Use of an &#34;immunodeficiency-virus suppressing lymphokine (ISL) &#34;to inhibit the replication of viruses, in particular of retroviruses

ABSTRACT

An isolated polypeptide which inhibits the replication of retroviruses in peripheral blood lymphocytes and i) is encoded by the DNA sequence shown in SEQ ID NO: 1 or complementary sequences; or ii) is encoded by DNA sequences which are degenerate to those of SEQ ID NO: 1 or its complement, and nucleic acids encoding said polypeptide. The protein is useful for inhibition of viral replication.

CROSS REFERENCE TO RELATED APPLICATION

This application is a continuation-in-part of International Application PCT/EP96/01486, filed Apr. 4, 1996, and designating the U.S.

The present invention concerns the use of an "immunodeficiency-virus suppressing. lymphokine (ISL)" to inhibit the replication of viruses, therapeutic compositions containing ISL or nucleic acid molecules coding therefor.

ISL activity is defined by inhibition of HIV replication on primary lymphocytes (PBMC).

It is known that certain CD8⁺ cells, e.g., those of human and animal origin which, in addition to being CD8⁺ are HLA-DR⁺, CD28⁺, or CD11B⁻, show activity in suppressing immunodeficiency viruses, such as HIV and SIV. This activity has been attributed to a molecule referred to as immunodeficiency virus suppressing lymphokine or "ISL". ISL is capable of inhibiting the replication of viruses in CD4⁺ cells that are infected with HIV or SIV (Ennen, Findkelee, Kurth et al (Proc. Natl. Acad. Sci. USA, Vol. 91, p. 7207-7211 (1994). However, the identity of ISL has to date been unclear. It has been known since at least 1989 (Walker, C. M., and Leighly, J. A., Immunology, Vol. 66, p. 628-630 (1989)) that there exists a soluble factor secreted by stimulated human CD8⁺ T lymphocytes that down-regulates HIV replication in CD4⁺ T cells. However, it has until now been impossible to establish whether this activity results from a single substance, and it has also until today even been impossible to isolate and characterize a substance with this activity although there exist a lot of publications in which methods of setting up corresponding cell cultures are described and methods of purifying such antiviral factors are suggested. Such publications are, e.g., WO 94/23058 and WO 93/0883 as well as by Mackewicz et al (Lancet, 344, p. 1671-1673 (1994)); Mackewicz et al (AIDS Research and Human Retroviruses, Vol. 8, No. 6, p. 1039-1050 (1992); Castro, Walker et al (Cellular Immunology 132, p. 246-255, (1991); Blackbourn et al (Journal of Medical Primatology No. 23, p. 343-354 (1994); Chen et al (AIDS Research and Human Retroviruses Vol. 9, No. 11, p. 1079-1086)); Kannagi et al (The Journal of Immunology, Vol. 140, No. 7, p. 2237-2242 (1988)); Joag et al (Virology 200, p. 436-446 (1994)); Walker, Moody et al (Science, Vol. 234, p. 1563-1566 (1986)); Walker, Erickson et al (Journal of Virology, Vol. 65, No. 11, p. 5921-5927 (1991)); Walker, Thomson-Honnebier et al (Cellular Immunology 137, p. 420-428 (1991)); Knuchel, Bednarik et al (Journal of Acquired Immune Deficiency Syndromes, No. 7, p. 438-446 (1994)); Ennen, Findeklee, Kurth et al (Proc. Natl. Acad. Sci. USA, Vol. 91, p. 7207-7211 (1994)) and Hsueh, Walker et al (Cellular Immunology 159, p. 271-279 (1994)).

In spite of the above-mentioned intensive investigation that has been carried out on the immunodeficiency virus-suppressing activities produced from CD8⁺ cells for about seven years, the biological nature (especially the molecular structure of ISL), apart from the assumption that ISL is a protein, is completely unclear. Also unclear are:

its gene or genes if it consists of several factors;

the way in which it acts on infected and non-infected CD4⁺ cells. There are preliminary indications that the action of ISL is based on a negative regulation of the transcription rate of HIV-LTR (long terminal repeat);

its mechanism of action in other infections. It may be assumed that viruses whose transcription is regulated by transcription factors that are comparable to those of HIV will also be subjected to a negative regulation by ISL;

its mechanism of action on normal and malignant cell proliferation. According to the present state of knowledge it cannot be ruled out that, similar to the interferons, an inhibitory effect on normal or malignant cell proliferation may be possible;

the fact whether ISL could regulate CD4 expression;

why in the case of HIV-infected patients a decrease in the ISL activity that is measurable in vitro occurs over time;

the fact whether ISL may also be partly responsible for the long latency period between infection and development of disease in humans infected with HIV, i.e., be positively correlated with a positive prognosis.

The problem has therefore arisen of identifying ISL and to clarify whether it represents one or several substances as well as to examine it with regard to its therapeutic action on immunodeficiency viruses and other viruses.

Cruikshank and Center (Journal of Immunology 128 (1982) 2569-2574) describe a protein called "lymphocyte chemoattractant factor" (LCF), which has a sequence quite similar to the sequence of the polypeptides of the invention. It is expressed by human lymphocytes and is a member of the group of lymphokines. After appropriate purification by gel filtration a homogeneous product was obtained with a molecular weight of approximately 56,200 which is cleaved by sodium dodecyl sulphate into monomers with a molecular weight of ca. 14,400. It was assumed that this lymphokine played a role in the formation and amplification of the delayed type of immune response (delayed type hypersensitivity reaction).

The nucleic acid sequence of LCF is described by Cruikshank, W., et al., Proc. Nat. Acad. Sci. USA 91 (1994) 5109-5113. The nucleotide sequence and the protein sequence derived therefrom are available under Accession Number M90391 at GenBank data base and are shown in SEQ ID NO:3 and SEQ ID NO:4.

From Cruikshank et al (Journal of Immunology 138, 3817-3823 (1987)) it is also known that LCF stimulates the expression of interleukin 2 (IL2) receptors and HLA-DR antigens on CD4⁺ lymphocytes. LCF is therefore also referred to as growth factor. Furthermore Cruikshank et al described in the Journal of Immunology 146, 2928-2934 (1991) that LCF induces CD4-dependent intracytoplasmic signals in lymphocytes and thus concluded that these signals act as a second type of messengers. In the J. Exp. Med. 173, p. 1521-1528 (1991) Rand, Cruikshank et al additionally describe the stimulation of human eosinophils by LCF and its massive production by activated T-lymphocytes. Finally in Proc. Natl. Acad. Sci. USA, Vol. 91, p. 5109-5113 (1994) Cruikshank et al described a cloning of LCF by isolating the LCF cDNA from an expression library from mitogen-stimulated mononuclear blood cells (PBMC: peripheral blood mononuclear cells) and introduction into E. coli to produce biologically active recombinant LCF protein (rLCF). Recombinant LCF shows an isoelectric point of 9.0 (Center, D. M., et al., J. Lab. Clin. Med. 125 (1995) 167-171).

Cruikshank W., et al. (Proc. Nat. Acad. Sci. USA 91 (1994) 5109-5113) describes that LCF may contribute to recruitment of eosinophils and CD4⁺ mononucleic cells concomitantly in intracellular reactions. Cruikshank further suggests that LCF activity on CD4⁺ cells would provide a mechanism for the accumulation of non-sensitized T cells in tissue. Its ability to prime CD4⁺ T cells for IL-2 responsiveness might play a role in the specific expansion of this T cell population. In WO 94/28134 the same authors suggest to use LCF as an immunosuppressive agent or as part of an immunosuppressive therapy. However, an antiviral activity of LCF was neither described in nor obvious from these publications. To the contrary, Center, D. M., et al., (1995) (supra) conclude that LCF does amplify the inflammatory process.

SUMMARY OF THE INVENTION

The subject-matter of the invention is the identification and molecular cloning of an immuno-deficiency-virus suppressing lymphokine (ISL) and the isolation of nucleic acid molecules which encode polypeptides with ISL activity. Such polypeptides have improved properties, especially a higher activity than the polypeptide described in WO 94/28134. More specifically, the invention relates to those nucleic acid molecules which encode eukaryotic ISL, including human, monkey and other species. Specifically preferred are nucleic acid molecules which hybridize to SEQ ID NO:1 under stringent conditions as set forth below and code for a polypeptide with ISL activity. It can be shown that natural, synthetic, and recombinantly produced ISL is able to suppress the replication of viruses, especially of retroviruses, in vivo and in vitro.

The nucleotide sequences according to the invention encode a polypeptide that binds to CD4⁺ lymphocytes and can suppress the replication of viruses such as, in particular, HIV-1, HIV-2 and SIV strains. Therefore, such polypeptides, active fragments and derivatives also are a subject-matter of the present invention. The function of ISL is not limited by its presentation in an MHC complex.

A further subject-matter of the invention is the use of ISL for the therapeutic treatment of viral infections, preferably retroviral infections and/or viral-based benign and malignant diseases, and its use for the production of a therapeutic composition containing ISL, as well as its use for the manufacture of such therapeutic agents.

A further subject-matter of the invention is a therapeutic composition containing ISL in an amount effective for treatment of such diseases, especially viral infections. The pharmaceutical composition or agent also contains suitable pharmaceutically compatible carrier substances.

A further subject-matter of the invention is a polyclonal or monclonal anti-ISL antibody or an immuno-active fragment thereof, as well as methods for producing such antibodies and their use for ISL determination and detection of viral infections of eukaryotic cells, especially mammalian samples, preferably derived from mammalian cells.

Another subject-matter of the invention is the use of ISL for the detection of virus-activated mammalian cells, especially T cells.

A further subject-matter of the invention is a method for the determination of soluble or insoluble, free or cell-bound ISL. Such a diagnostic method can be used for the detection of acute or chronic infections, for monitoring the course of viral infections and/or for the monitoring and detection of viral-based benign and malignant diseases.

A further subject-matter of the invention is the use of a nucleotide molecule which can secure expression of ISL in a eukaryotic cell for the activation of ISL in human cells, for in vivo or ex vivo gene therapy.

Another subject-matter of the invention is a therapeutic composition useful in treating a pathological condition characterized by viral replication, especially retroviral replication, comprising at least a substance which activates ISL activity in CD8⁺ T cells, and a pharmaceutically acceptable carrier.

Therefore, a further subject-matter of the invention is a method for the production of a substance and a therapeutic agent for inhibition of the replication of viruses in a patient, said method comprising combining with a pharmaceutically acceptable carrier a therapeutically effective amount of a substance which activates expression of a protein with ISL activity, preferably of a protein with the amino acid sequence shown in SEQ ID NO:2 in CD8⁺ T cells, in vivo and in vitro, to such an extent that viral replication in CD4⁺ cells is inhibited.

A further subject-matter of the invention is a therapeutic composition useful in treating a pathological condition characterized by viral replication, especially retroviral replication, comprising at least a substance which activates ISL activity in CD8⁺ T cells, and a pharmaceutically acceptable carrier.

LEGENDS TO THE FIGURES

FIG. 1 Inhibition of the HIV-1_(SF2) replication on the T cell lymphoma line H9 by purified recombinant ISL (B) and by a cell culture supernatant of activated human CD8⁺ lymphocytes (ISL) (C). A quantitative comparable inhibition of viral replication was measured using the following HIV and SIV strains: HIV-1_(SF2), HIV-1_(SF33), HIV-1_(SF162), HIV-2_(UC3) and SIV_(agm). ((A) comparison, only tissue culture).

FIG. 2 Inhibition of HIV-1_(SF2) replication on primary CD4⁺ lymphocytes by recombinant ISL. LogTCID₅₀ : logarithm of tissue culture infectious dose₅₀ (infectious dose₅₀ of cell culture). Quantitatively comparable inhibitions are seen with the following immunodeficiency virus strains: HIV-1_(SF33), HIV-1_(SF162), HIV-2_(UC3) and SIV_(agm).

FIG. 3 Comparison of DNA and polypeptide sequences of human and monkey ISL. a) human; b) P. troglodytes (chimpanzee); c) M. mulatta chin.; d) M. mulatta ind.; e) M. nemestrina; f) M. fascicularis; g) C. aethiops (AGM).

DETAILED DESCRIPTION OF THE INVENTION

The invention is based on new isolated polypeptides with ISL activity which inhibit the replication of HIV-1, preferably HIV-1_(SF2) in CD8⁺ -depleted peripheral blood lymphocytes (PBMC) which are prepared from buffy coat of non-retrovirally infected normal human blood samples in an assay (also referred to in the following as an HIV inhibition assay),

a) whereby said CD8⁺ -depleted PBMC are incubated with 0.1 μg of said polypeptide/1.5×10⁶ cells in 150 μl medium for half an hour at 37° C.;

b) said CD8⁺ -depleted PBMC are subsequently infected with HIV-1, preferably with HIV-1_(SF2) by incubating 1.5×10⁶ cells in 150 μl with 50 μl HIV-1 stock solution containing 50 tissue infectious doses₅₀ (TCID₅₀) for 1 h at 37° C.;

c) said infected CD8⁺ -depleted PBMC are washed to remove unbound HIV-1 and, preferably, polypeptide;

d) CD8⁺ -depleted PBMC are cultivated at 37° C. in a 5% CO₂ atmosphere and medium is changed and said polypeptide is added after 3, 6, 9, and 12 days;

e) the amount of HIV-1 in the CD8⁺ -depleted PBMC cell culture supernatants i s determined at days 9 and 12 post infection by serially threefold dilutions of supernatant and inoculation in quadruplicate wells onto 2000 cells in 150 μl medium of a highly susceptible indicator cell line, which must be routinely infectable to an extent of 85% or greater with said HIV-1 preferably the human HTLV-transformed lymphoma cell line MT4;

f) virus replication in each well is determined 8 days post infection by determination of the reverse transcriptase (RT) in the cell culture supernatant (there is preferably applied the Reverse Transcriptase Assay of Boehringer Mannheim GmbH, Biochemica, 68298 Mannheim, Germany, Order No.: 1468 120) of every single well following the instructions of the manufacturer);

g) the tissue culture infectious doses 50 (TCID₅₀) of the CD8⁺ -depleted PBMC cultures is calculated preferably following the method published by Karber (Karber, G. 1931. Assay for statistical analysis of pharmacological experiments. Arch. Exper. Path. V. Pharmakol. 162, 148) according to the formula:

    log TCID.sub.50 =L-d (s-0.5),

wherein

L=log of the lowest virus dilution

d=log of virus dilution

s=sum of virus-positive cell cultures;

h) inhibition of HIV-1 replication in the CD8⁺ -depleted PBMC cultures is calculated by comparison of virus content of cell culture supernatants in an assay according to steps a) to g) and the virus content of an assay according to steps a) to g) where said polypeptide to be tested for inhibition of HIV replication is replaced by buffer without polypeptide (untreated control);

i) inhibition is found if the amount of viral replication in CD8⁺ -depleted PBMC is inhibited in such a way that the amount of virus is about 50% or less, more preferably 10% or less, most preferably 1% or less in comparison to the untreated control,

and the polypeptide

i) is coded by the DNA sequence shown in SEQ ID NO:1 or a sequence complementary to the sequence shown in SEQ ID NO:1,

ii) is coded by a DNA sequence which hybridizes with SEQ ID NO:1 or SEQ ID NO:3 or which hybridizes with a DNA sequence complementary to SEQ ID NO:1 or SEQ ID NO:3, under stringent conditions,

iii) is coded by DNA sequences which, if there was no degeneracy of the genetic code, would hybridize under stringent conditions with the sequences defined in i) or ii),

with the proviso that the polypeptide differs from the polypeptide coded by the DNA sequence shown in SEQ ID NO:3.

Useful polypeptides with ISL activity besides the preferred polypeptides of SEQ ID NO:2 or SEQ ID NO:4 are, for example, the also preferred polypeptides of FIG. 3. FIG. 3 also shows DNA sequences which code for these polypeptides. A further preferred polypeptide is a polypeptide according to the invention wherein amino acid 26 (alanine) is deleted. Such a polypeptide also exists as natural allelic forms in humans and monkeys.

In the examples which follow, a strain of HIV-1 known as HIV-1_(SF2) is used. This is a typical North American/European strain. Its nucleotide sequence is set forth in SEQ ID NO:7, and is also accessible via GenBank Accession Number K02007. Other HIV strains include HIV-1_(SF33) (SEQ ID NO:8), as well as strains set forth in, e.g., Cheng-Mayer et al., J. Virol 64 (1990) 4390-4398; Levy, J. A., et al., Science 232 (1986) 998-1001; Luciw, P. A., et al., Nature 312 (1984) 760-763; Sanchez-Pescador, R, et al., Science 227 (1985) 484-492.

In the assay applied for determination of ISL activity, Ficoll gradient purified and phytohaemagglutinin (PHA) stimulated PBMC were infected with an HIV-1 strain and cultivated. CD8⁺ -depleted PBMC are used because of improved accuracy compared to the use of PBMC. However, it is also possible to use PBMC. CD8⁺ -depleted cells are selected by cell sorting using specific antibodies. Such methods are widely known state of the art. The culture supernatants are tested for their virus content. For this purpose, e.g., determination of the reverse transcriptase or P24 antigen can be carried out. Another possibility is to determine the level of infection of highly susceptible indicator cell lines, referred to a virus-free cell supernatant in each case. In such a test, ISL activity will be found if the substance to be tested causes a reduction of reverse transcriptase activity for at least about 50%, preferably 70%, more preferably 90% or more.

The polypeptide can be defined by its DNA sequence and by the amino acid sequence derived therefrom. The ISL polypeptide can occur in natural allelic variations which differ from individual to individual. Such variations of the amino acids are usually amino acid substitutions. However, they may also be deletions, insertions or additions of amino acids to the total sequence. The ISL protein according to the invention--depending, both in respect of the extent and type, on the cell and cell type in which it is expressed--can be in glycosylated or non-glycosylated form. Polypeptides with ISL activity can easily be identified by the above-described HIV inhibition assay.

FIG. 3 shows a comparison of the DNA and polypeptide sequences of human and different monkey ISL. It was found furthermore that an allelic variant wherein codon 26 (coding for Ala) is deleted exists in all of these species. As can be seen from FIG. 3, ISL polypeptides and nucleic sequences coding therefor, wherein amino acid 7 is Ser or Thr, amino acid 25 is Thr or Ser, amino acid 31 is Cys or Tyr, amino acid 76 is Val or Ile, amino acid 86 is Gly or Ala, amino acid 112 is Ile or Thr, amino acid 121 is Ser or Pro and/or amino acid 128 is Gly or Ala, are preferred. There are also preferred polypeptides in which amino acid 26 is deleted. Such variations can improve the antiviral tumor therapeutic (benign or malignant) and/or immunosuppressive activity of ISL without changing the biological properties in general.

"Polypeptide with ISL activity or ISL" means also proteins with minor amino acid variations but with substantially the same ISL activity. Substantially the same means that the activities are of the same biological properties and the polypeptides show preferably at least 75% homology in amino acid sequence. More preferably, the amino acid sequences are at least 90% identical.

"Indicator cell line" means as lymphoma cell line which must be routinely injectable to an extent of 85% or greater with the HIV-1 strain which is used in the HIV inhibition assay. Preferably, such an indicator cell line is MT4 which is described in Norley, S. G., et al, Biologicals 21 (1993) 251-258 which is incorporated herein by reference. Other useful lymphoma cell lines are described by Cheng-Mayer, C., et al, Virol. 181 (1991) 288-294 and J. Virol. 65 (1991) 6931-6941 which also are incorporated herem by reference. In these publications there are described lymphoma cell lines which are injectable by HIV-1 to a greater or lesser extent. From the named cell lines only those cell lines which are routinely injectable to an extent of 85% or greater with the HIV strain of the HIV inhibition assay are useful.

"ISL activity" denotes the anti-viral action of the tissue culture supernatant of activated and non-activated CD8⁺ lymphocytes of human (ISL) or animal origin (e.g. ISL in the lymphocytes of African green monkeys (ISL-agm)).

"ISL" preferably denotes the molecule whose sequence is shown in SEQ ID NO:1, 2, 3, or 4.

ISL is a polypeptide which is active in its glycosylated or unglycosylated form. The unglycosylated form can be produced by recombinant technology in prokaryotic cells.

ISL is produced by non-activated (small amount) as well as by activated T lymphocytes. ISL binds to CD4⁺ lymphocytes, preferably to the CD4 receptor molecule or to a molecule associated with the CD4 molecule. ISL has suppressed the replication of all HIV-1 and HIV-2 strains tested up to now as well as all previously tested SIV strains. This effect can be observed on CD4⁺ lymphocytes from peripheral blood as well as in a number of human CD4⁺ -positive T cell lymphomas. ISL has an interspecies-specific action since at least the ISL of the African green monkey is capable of suppressing the replication of HIV in human CD4⁺ cells. The function of ISL is not limited by an incompatibility of the MHC locus (major histocompatibility complex) and it does not have a lytic action on cells. ISL is synthesized by the CD8⁺ lymphocytes of asymptomatic patients infected with HIV and less by cells from symptomatic patients. ISL is also produced by activated CD8⁺ cells of healthy blood donors. The extent of ISL synthesis correlates quantitatively with the clinical status of HIV-infected patients. The ISL activity in asymptomatic HIV patients is higher (with a comparable number of activated CD8⁺ lymphocytes) than in symptomatic patients. The antiviral action of ISL is not identical with previously known lymphokines and interferons. ISL activity has also been detected in the cell culture supernatant of activated CD8⁺ lymphocytes of HIV-infected and non-infected chimpanzees as well as of SIV-infected and non-infected African green monkeys, Rhesus monkeys and Sooty mangabees (Ennen, J., et al, Proc. Natl. Acad. Sci. USA 91 (1994) 7207-7211). ISL may be capable of protecting against superinfections with other HIV/SIV strains (Cheng-Mayer, C., et al, J. Virol. 64 (1990) 4390-4398).

A protein with ISL activity is described in Cruikshank et al, Proc. Natl. Acad. Sci. USA 91 (1994) 5109-5113 and WO 94/28134 and is named LCF (see supra). This protein is coded by SEQ ID NO:3 and therefore has the sequence SEQ ID NO:4. Cruikshank refers, for the sequence reported to Accession Number M90391 accorded by GenBank data base. Whereas the protein sequences shown in GenBank and FIG. 2 of Cruikshank are identical, the nucleic acid sequences exhibit a difference in nucleotide 1070. Whereas in the GenBank sequence this nucleotide is T, in FIG. 2 of Cruikshank's publications this nucleotide is G. As TTG does not code for Phe but for Leu, it is clear that G is a typographical error. This is confirmed by cloning of ISL cDNA derived from independent PCR amplifications. From these clones it is clear that the LCF sequence in codon 96 is indeed represented by the sequence TTT. Therefore, nucleotide 1070 clearly is T.

The term "nucleic acid molecule" denotes a polynucleotide which can be, for example, a DNA, RNA, or derivatized active DNA or RNA. DNA and/or RNA molecules are preferred, however.

The term "hybridize under stringent conditions" means that two nucleic acid fragments are capable of hybridization to one another under standard hybridization conditions described in Sambrook et al., "Expression of cloned genes in E. coli" in Molecular Cloning: A laboratory manual (1989) Cold Spring Harbor Laboratory Press, New York, USA, 9.47-9.62 and 11.45-11.61.

More specifically, "stringent conditions" as used herein refer to hybridization in 6.0× SSC at about 45° C., followed by a wash of 2.0× SSC at 50° C. For selection of the stringency the salt concentration in the wash step can be selected, for example from about 2.0× SSC at 50° C., for low stringency, to about 0.2× SSC at 50° C., for high stringency. In addition, the temperature in the wash step can be increased from low stringency conditions at room temperatures, about 22° C., to high stringency conditions at about 65° C.

The term "isolated" as used throughout this application refers to a nucleic acid or polypeptide having an ISL activity and is substantially free of cellular material or culture medium, when produced by recombinant DNA techniques, or chemical precursors or other chemicals, when synthesized chemically. An isolated nucleic acid is preferably free of sequences which naturally flank the nucleic acid (i.e. sequences located at the 5' and the 3' ends of the nucleic acid) in the organism from which the nucleic acid is derived.

ISL can be isolated and purified from activated T cells by affinity chromatography using a monoclonal antibody against ISL. It is also preferred to use other known protein purification techniques, including immunoprecipitation, gel filtration, ion exchange chromatography, chromatofocussing, isoelectric focussing, selective precipitation, electrophoresis, and the like. Fraction isolated during purification procedures can be analyzed for the presence of ISL activity by using ISL specific antibodies.

The polypeptides according to the invention can also be produced by recombinant means, or synthetically. Non-glycosylated ISL polypeptide is obtained when it is produced recombinantly in prokaryotes. With the aid of the nucleic acid sequences provided by the invention it is possible to search for the ISL gene or its variants in genomes of any desired cells (e.g. apart from human cells, also in cells of other mammals), to identify these and to isolate the desired gene coding for the ISL protein. Such processes and suitable hybridization conditions are known to a person skilled in the art and are described, for example, by Sambrook, J., et al., "Expression of cloned genes in E. coli" in Molecular Cloning: A laboratory manual (1989) Cold Spring Harbor Laboratory Press, New York, USA, and B. D. Hames, S. G. Higgins, Nucleic acid hybridisation--a practical approach (1985) IRL Press, Oxford, England. In this case the standard protocols described in these publications are usually used for the experiments.

The use of recombinant DNA technology and the knowledge of the HIV inhibition assay enables the production of numerous active ISL derivatives. Such derivatives can, for example, be modified in individual or several amino acids by substitution, deletion or addition. The derivatization can, for example, be carried out by means of site directed mutagenesis. Such variations can be easily carried out by a person skilled in the art (I. Sambrook, B. D. Hames, loc. cit.). It merely has to be ensured by means of the above-mentioned HIV inhibition assay that the characteristic properties of ISL (inhibition of virus replication) are preserved. The invention therefore in addition concerns an ISL polypeptide which is a product of a prokaryotic or eukaryotic expression of an exogenous DNA.

The invention further concerns an isolated nucleic acid molecule which codes for a polypeptide or active fragment or derivative thereof, which inhibits the replication of HIV-1 in CD8⁺ -depleted PBMC, said PBMC being prepared from buffy coat of non-retrovirally infected normal human blood samples, in the above-mentioned HIV inhibition assay, and wherein said nucleic acid molecule is selected from the group of

i) a DNA molecule as shown in SEQ ID NO:1 or a sequence complementary to the sequence shown in SEQ ID NO:1,

ii) nucleic acid molecules which hybridize with SEQ ID NO:1 or SEQ ID NO:3 or which hybridize with a DNA sequence complementary to SEQ ID NO:1 or SEQ ID NO:3, under stringent conditions,

iii) nucleic acid molecules which, if there was no degeneracy of the genetic code, would hybridize under stringent conditions with the sequences defined in i) or ii),

with the proviso that said isolated nucleic acid molecule is not identical with SEQ ID NO:3.

In a preferred embodiment of the invention, also nucleic acid molecules are disclaimed which code for a polypeptide of SEQ ID NO:4.

With the aid of such nucleic acids coding for an ISL protein, the protein according to the invention can be obtained in a reproducible manner and in large amounts. For expression in prokaryotic or eukaryotic organisms, such as prokaryotic host cells or eukaryotic host cells, the nucleic acid is integrated into suitable expression vectors, according to methods familiar to a person skilled in the art. Such an expression vector preferably contains a regulacatable/induclible promoter. These recombinant vectors are then introduced for the expression into suitable host cells such as, e.g., E. coli as a prokaryotic host cell or Saccharomyces cerevisiae, Terato carcinoma cell line PA-1 sc 9117 (Buttner et al., Mol. Cell. Biol. 11 (1991) 3573-3583), insect cells, CHO or COS cells as eukaryotic host cells and the transformed or transduced host cells are cultured under conditions which allow an expression of the heterologous gene. The isolation of the protein can be carried out according to known methods from the host cell or from the culture supernatant of the host cell. Such methods are described for example by Ausubel I., Frederick M., Current Protocols in Mol. Biol. (1992), John Wiley and Sons, New York. Also in vitro reactivation of the protein may be necessary if it is not found in soluble form in the cell culture.

The detection of transformed or transduced host cells which recombinantly produce the ISL protein and the purification of the protein are preferably carried out by means of antibodies which bind to this protein. Such antibodies can be obtained in a simple manner according to known methods by using the protein according to the invention as an antigen or an immunogen.

The invention therefore in addition concerns the use of the protein with ISL activity according to the invention for the production of antibodies which bind to this protein.

Anti-ISL antibodies are produced by immunization and appropriate vertebrate host with purified ISL or polypeptide derivatives of ISL, preferably with an adjuvant. Said techniques are well-known in the literature and are described, for example, by Harlow and Lane eds., Antibodies: A laboratory manual (1988), Cold Spring Harbor Laboratories Press.

For this, animals which are usually used for this purpose, such as, in particular, sheep, rabbits or mice, are immunized with the protein according to the invention (preferably with the protein of FIG. 3), and subsequently the antiserum is isolated from the immunized animals according to known methods or spleen cells of the immunized animals are fused with immortalized cells, such as e.g. myeloma cells, according to the method of Kohler and Milstein (Nature 256 (1975) 495497). Those cells which produce a monoclonal antibody against the ISL protein are selected from the hybridoma cells obtained in this way and cloned. The monoclonal or polyclonal antibodies obtained in this way can be bound to a support material, such as e.g. cellulose, for an immunoabsorptive purification of ISL. Furthermore, antibodies of this kind can be used for the detection of ISL in samples, such as e.g. cut tissue or body fluids, preferably for the determination of viral infections and virally induced benign and malignant diseases, most preferably for the determination of retroviral infections in mammalian samples. In such assays ISL is bound immunologically to its antibody in the specific step. The invention therefore additionally concerns specific antibodies against the ISL protein preferably the ISL proteins not disclosed by Cruikshank, which are obtainable by immunizing an animal with said ISL protein and isolating the antibodies from the serum or spleen cells of the immunized animals, and their use for the determination of ISL.

The invention in addition concerns the use of a polypeptide defined in the above-mentioned manner including a protein of SEQ ID NO:3, for the production of a pharmaceutical agent and for the treatment of viral infections, preferably retroviral infections such as HIV infections, and for use in therapy of benign and malignant diseases, especially in tumor therapy, most preferably for the treatment of viral-induced tumors.

The protein is processed, if desired together with the usually used auxiliary agents, fillers and/or additives, in a pharmaceutical formulation for the said therapeutic applications.

The invention therefore in addition concerns a therapeutic composition containing a ISL polypeptide according to the invention and if desired together with the auxiliary agents, fillers and/or additives that are usually used.

When the polypeptides according to the invention are applied for therapeutic use, their doses depend on the intended use. To find out the dose and optimize the application, usually such properties of the polypeptide as the half-life and bioavailability and the patient's age and weight will also be taken into account. Optimum therapeutic effectiveness is achieved when the polypeptides according to the invention are applied as soon after the infection as possible, preferably as soon after the first virus peak as possible. Here it is important that a concentration of the polypeptides and substances according to the invention which effectively inhibits virus replication is retained in the blood during the early stage of viral infection. This can be accomplished, for example, by the application of 1 to 1000 μg/patient of the polypeptide according to the invention at 12 to 72 h-intervals. The period of application can be determined, suitably, by the method of determination of virus replication or virus quantity according to the invention or by other methods of virus determinations known to those skilled in the art. The application period may be in the range of from a few days to a few months.

The invention further concerns the use of the ISL genes or fragments thereof, preferably nucleic acid molecules coding for a polypeptide having ISL activity, or activating polynucleotides from the 5' untranslated region, in gene therapy, and in particular, for the production of medicaments for gene therapy, preferably for an antiviral or immunosuppressive therapy, or a therapy of benign or malignant diseases.

Gene therapy of somatic cells can be accomplished by using, e.g., retroviral vectors, other viral vectors, or by non-viral gene transfer (for clarity cf. T. Friedmann, Science 244 (1989) 1275; Morgan 1993, RAC DATA MANAGEMENT REPORT, June 1993).

Vector systems suitable for gene therapy are, for instance, retroviruses (Mulligan, R. C. (1991) in Nobel Symposium 8: Ethiology of human disease at the DNA level (Lindsten, J. and Pattersun Editors) 143-189, Raven Press), adeno associated virus (McLughlin, J. Virol. 62 (1988), 1963), vaccinia virus Moss et al., Ann. Rev. Immunol. 5 (1987) 305), bovine papilloma virus (Rasmussen et al., Methods Enzymol. 139 (1987) 642) or viruses from the group of the herpes viruses such as Epstein Barr virus (Margolskee et al., Mol. Cell. Biol. 8 (1988) 2937) or herpes simplex virus.

There are also known non-viral delivery systems. For this,. usually "nude" nucleic acid, preferably DNA, is used, or nucleic acid together with an auxiliary agent, such as, e.g., transfer reagents (liposomes, dendromers, polylysine-transferrine-conjugates (Felgner et al., Proc. Natl. Acad. Sci. USA 84 (1987) 7413).

There is particularly preferred an ex vivo gene therapy as described, e.g., in W. F. Anderson et al., U.S. Pat. No. 5,399,346. According to this method a polypeptide according to the invention is provided to a human by introducing human cells into a human, said human cells having been treated in vitro to insert therein a DNA segment encoding a polypeptide according to the invention, said human cells expressing in vivo in said human a therapeutically effective amount of said polypeptide. As human cells there are used preferably fibroblasts or autologous hematopoietic stem cells which are characterized preferably by CD3⁺, CD4⁻, CD8⁻. Primitive human hematopoietic progenitor cells, which are characterized by a high expression of CD34 and the absence of CD38 expression, are particularly preferred. However, also more differentiated hematopoietic stem cells such as CD34⁺ and CD38⁺ cells can be used. Such cells are described, e.g., by Terstappen et al., Blood 77 (1991) 1218 or Huang and Terstappen, Nature 360 (1992) 745. For the transfection of fibroblasts it is preferred to use cytomegalovirus (CMV)-based vectors. For the transfection of hematopoietic stem cells it is preferred to use retroviral vectors based on the molony murine leukemia vector (MMLV). Such techniques are described in the state of the art, e.g., in the above-mentioned U.S. Pat. No. 5,399,346 which is incorporated herein by reference. For the regulation of the therapeutic application, the use of a suicide gene system (e.g., tk-Gen (Ganciclovir)) is preferred.

Another preferred method of gene therapy is based on homologous recombination. In this, either the gene coding for the ISL protein can be inserted in one or more copies into the genome of somatic cells and/or the ISL gene endogenously present in the cells can be modulated, preferably activated.

Methods of homologous recombination are described, e.g., in Kucherlapati, Proc. in Nucl. Acids Res. and Mol. Biol. 36 (1989) 301; Thomas et al., Cell 44 (1986) 419-428; Thomas and Capecchi, Cell 51 (1987) 503-512; Doetschman et al., Proc. Natl. Acad. Sci. USA 85 (1988) 8583-8587 and Doetschman et al., Nature 330 (1987) 576-578. In these methods, a portion of DNA to be integrated at a specific site in the genome (gene fragment of ISL) is bound to a targeting DNA. The targeting DNA is a DNA which is complementary (homologous) to a region (preferably within or proximal to the ISL gene) of the genomic DNA. When two homologous portions of a single-stranded DNA (e.g. the targeting DNA and the genomic DNA) are in close proximity to one another they will hybridize and form a double-stranded helix. Then the ISL gene fragment and the targeting DNA can be integrated into the genome by means of occurrence of recombination. This homologous recombination can be carried out both in vitro and in vivo (in the patient).

Preferably, there is used a DNA which codes for a protein having ISL activity, a fragment which inhibits ISL expression (knock-out sequence) or a fragment capable of activating, after integration of the genome of a cell, expression, in this cell, of a protein having ISL activity. Such a fragment may be, for example, a promoter and/or enhancer region which is heterologous to the corresponding ISL region or which, after integration into the ISL gene, activates the actually silent or to a little extent expressed ISL gene transcriptionally and/or translationally.

Thus, by means of this DNA, one or more ISL genes are newly introduced into the target cell, or the essentially transcriptionally silent gene in the genome of a mammalian cell is activated in such fashion that the mammalian cell is enabled to produce endogenous ISL protein. To this end, a DNA construct is inserted into the genome by homologous recombination, the DNA construct comprising the following: a DNA regulatory element capable of stimulating expression of this gene if operatively linked thereto; and one or more DNA target segments which are homologous to a region in this genome, which region is within or proximal to this gene. This construct is inserted into the genome of the mammalian cell in such fashion that the regulatory segment is operatively linked to the gene which codes for the protein having ISL activity. Preferably, the construct further comprises amplifying sequences, especially if genes coding for proteins with ISL activity are inserted into the cell.

For the introduction of ISL genes into the target cells, the construct comprises a regulatory element, one or more ISL genes and one or more target segments. The target segments are chosen in such a way that they hybridize with an appropriate region of the genome, whereby, after homologous recombination, the inserted exogenous ISL genes are expressed.

There are known a large number of processes by which homologous recombination can be initiated. Preferably, homologous recombination takes place during DNA replication or mitosis of the cells. A DNA of this kind can be used for the production of an agent for therapeutic treatment of tumors and viral infection or for the production of homologous or heterologous ISL protein in a host organism.

A further subject-matter of the invention is a method for the determination of ISL polypeptides, nucleic acid sequences, virus-activated cells and ISL expression, preferably in samples of the human body such as human cell preparations, cell supernatants and body fluids such as blood, serum or plasma. Such a determination is useful for the detection of a viral infection, preferably of a mammalian, especially human, cell population. This method is particularly useful for the determination of the activation state of said cells and for the determination of a viral, preferably retroviral, infection of CD4⁺ cells. The diagnostic method is preferably applied immediately or as soon as possible after the first virus peak.

A further subject-matter of the invention is the use of an antibody which binds immunologically to a polypeptide which is obtainable by immunizing an animal with an ISL polypeptide and isolating the antibodies from the serum or spleen cells of the immunized animals, for the determination of the ratio of activated/non-activated CD8⁺ and/or CD4⁺ cells in body fluids, especially in blood, serum or plasma.

Such tests can be provided on the basis of antibodies which are directed against part or all of ISL polypeptides. Such antibodies can be polyclonal or monoclonal antibodies, chimeric antibodies, humanized antibodies or fragments thereof such as F(ab), F(ab)₂, single chain F_(v), or the like. In such an assay, the antibodies are used for immuno-specific recognition of ISL. The further detection (with and without separation of this complex, and subsequent monitoring) can be done by the immuno-assays which are widely known in the state of the art. For instance, the antibody can be labelled by a monitoring agent such as a fluorescence indicator, radio-active or enzymatic labelling.

There is particularly preferred a diagnostic determination of ISL concentrations in serum and other body fluids as well as the number of ISL-producing cells, e.g., for the detection of acute or chronic infections (e.g. even in blood donors) or for monitoring the course of ((retro)viral) infections (e.g. in patients suffering from AIDS), wherein antibodies that are provided with a fluorescent indicated or a radioactive or enzymatic label or with a labelled anti-antibody are reacted, brought into contact with ISL or ISL-producing cells, the antigen/antibody complexes are separated in a known manner and their concentration is determined via the label.

A suitable test method comprises the steps of incubating CD8⁺ T cells, in vitro, with the substance to be tested and determining ISL activity, preferably after 1 to 12 days, by detecting ISL expression according to the invention or by determining ISL polypeptide, preferably by means of an anti-ISL-antibody-based test. Such a test is carried out, for example, in the following manner:

a) commercially available 96-well ELISA plates are coated with monoclonal anti-ISL-antibodies;

b) the sample to be tested for ISL content is added to an antibody coated well for 1 h at room temperature and the well is washed;

c) bound ISL is detected by incubation of an affinity purified polyclonal Goat-anti-ISL-IgG-Preparation followed by an anti-Goat specific horse radish peroxidase labelled antibody and subsequent visualisation with OPD.

Other immunological assays based on the state of the art are also suitable.

It is also possible to provide a test on the basis of the nucleic acid sequences of the ISL protein provided by the invention which can be used to detect nucleic acids preferably RNAS, most preferably mRNAS which code for ISL proteins. Such a test can for example be carried out in cells or cell lysates and by means of nucleic acid diagnostics. In this case the sample to be examined is brought into contact with a probe which would hybridize with the nucleic acid sequence coding for the ISL protein. A hybridization between the probe and nucleic acids from the sample indicates the presence of expressed ISL proteins. Such methods are known to a person skilled in the art and are for example described in WO 89/06698, EP-A 0 200 362, U.S. Pat. No. 2,915,082, EP-A 0 063 879, EP-A 0 173 251, EP-A 0 128 018. In a preferred embodiment of the invention, the nucleic acid of the sample which codes for an ISL protein is amplified before testing, e.g. by the well-known PCR technique. A derivatized (labelled) nucleic acid probe is usually used in the field of nucleic acid diagnostics. This probe is brought into contact with a carrier-bound denatured DNA or RNA from the sample and in this process the temperature, ionic strength, pH value and other buffer conditions are selected in such a way that--depending on the length of the nucleic acid sample and the resulting melting temperature of the expected hybrid--the labelled DNA or RNA can bind to homologous DNA or RNA (hybridization, see also Southern, E. M., J. Mol. Biol. 98 (1975), 503-517; Wahl, G. M. et al., Proc. Natl. Acad. Sci. USA 76 (1979), 3683-3687). Suitable carriers are membranes or carrier materials based on nitrocellulose (e.g. Schleicher and Schull, BA 85, Amersham Hybond, C.), reinforced or bound nitrocellulose in a powder form or nylon membranes derivatized with various functional groups (e.g. nitro group) (e.g. Schleicher and Schull, Nytran; NEN, Gene Screen; Amersham Hybond M.; Pall Biodyne).

The hybridized DNA or RNA is then detected by incubating the carrier, after thorough washing and saturation to prevent unspecific binding, with an antibody or antibody fragment. The antibody or antibody fragment is directed towards the substance incorporated into the nucleic acid probe during the derivatization. The antibody is in turn labelled. It is, however, also possible to use a directly labelled DNA. After incubation with the antibodies, it is washed again in order to only detect specifically bound antibody conjugates. The determination is then carried out via the label of the antibody or antibody fragment according to well-known methods.

The detection of the ISL expression can be carried out, for example

as an in situ hybridization with immobilized whole cells using immobilized tissue smears and isolated metaphase chromosomes,

as a colony hybridization (cells) and plaque hybridization (phages and viruses),

as a Northern hybridization (RNA detection),

as serum analysis (e.g. cell type analysis of cells in serum by slot-blot analysis),

after amplification (e.g. PCR technique).

The invention therefore includes a method for the detection of nucleic acids which code for an ISL protein which is characterized in that the sample to be examined is incubated with a nucleic acid probe which is selected from the group comprising

a) the DNA sequences shown in SEQ ID NO:1 and SEQ ID NO:3 or a complementary sequence to these,

b) nucleic acids which hybridize under stringent conditions with one of the sequences from a),

the nucleic acid probe is incubated with the nucleic acid from the sample and the hybridization of the nucleic acid in the sample and nucleic acid probe is detected, if desired, via a further binding partner.

Thus, ISL is a valuable prognostic marker in viral, benign and malignant disease diagnostics.

Surprisingly, it was found that according to the invention it is not necessary to use an ISL polypeptide or nucleic acid directly for inhibition of the replication of viruses. It is also possible to use substances which induce production of ISL in cells. Such cells preferably are human blood lymphocytes, especially CD8⁺ cells. For induction of ISL production said cells are incubated, in vivo or in vitro, with such activating substances. If activation is performed in vitro the cells are subsequently administered to the patient, e.g., according to the above-mentioned U.S. Pat. No. 5,399,346. According to the invention it is easily possible to identify such substances which activate ISL production.

It has been found that such substances are, e.g., phytohaemagglutinin (PHA), Concanavalin A (ConA), histamine, polypeptides or nucleic acid molecules. Nucleic acid molecules are used as vectors which contain further elements securing expression of said nucleic acid molecules in the target cells. Said elements are known in the state of the art (e.g., regulatory sequences, promoter and/or operator regions). Suitable target cells for transfection with such nucleic acid molecules are preferably human cells, most preferably human blood cells such as lymphocytes, especially CD8⁺ cells.

Therefore, a further subject-matter of the invention is a method for the identification and production of a substance and a therapeutic agent for inhibition of the replication of viruses in a patient. Said method comprises combining with a pharmaceutically acceptable carrier a therapeutically effective amount of a substance which activates expression of a protein with ISL activity in CD8⁺ cells preferably in vivo. The protein the expression of which is activated is preferably a protein with the amino acid sequence shown in SEQ ID NO:2. A suitable substance can be identified in an assay (substance assay) wherein

a) PBMC from healthy blood donors are isolated by Ficoll-gradient separation;

b) CD8⁺ cells are isolated by magnetic cell sorting;

c) the purity of the preparation is tested by FACS analysis;

d) the preparation should have a content of approximately 95% CD8⁺ cells and 5% non-CD8⁺ contaminants to ensure adequate stimulation of CD8⁺ cells;

e) the substance to be tested for induction of expression of ISL activity is added to the cell culture in a concentration range of 1 pM to 10 mM;

f) IL-2 is added to the cell culture (180 U/ml cell culture medium) or to the culture of the transfected cells if the substance is a nucleic acid molecule;

g) after three days medium is completely removed and cells are cultured with IL-2 (180 U/ml cell culture medium) for three days;

h) cell culture supernatant is centrifuged (×1000) to remove cells, sterile filtered, and aliquoted;

and further investigated in the above-mentioned HIV inhibition assay, whereby

i) CD8⁺ -depleted PBMC are incubated with 50 μl of said cell culture supernatants/1.5×10⁶ cells in 150 μl medium for half a hour at 37° C.;

k) said CD8⁺ -depleted PBMC are subsequently infected with HIV-1, preferably with HIV-1_(SF2), by incubating 1.5×10⁶ cells in 150 μl with 50 μl HIV-1 stock solution containing 50 tissue infectious doses₅₀ (TCID₅₀) for 1 h at 37° C.;

l) said infected CD8⁺ -depleted PBMC are washed to remove unbound HIV-1;

m) CD8⁺ -depleted PBMC are cultivated at 37° C. in a 5% CO₂ atmosphere and medium and said cell culture supernatant is replaced after 3, 6, 9, and 12 days;

n) the amount of HIV-1 in the CD8⁺ -depleted PBMC cell culture supernatants is determined at days 9 and 12 post infection by serially threefold dilutions of supernatant and inoculation in quadruplicate wells onto 2000 cells in 150 μl medium of a highly susceptible indicator cell line, which must be routinely injectable to an extent of 85% or greater with said HIV-1 strains, e.g. the human HTLV-transformed lymphoma cell line MT4;

o) virus replication in each well is determined 8 days post infection by determination of the reverse transcriptase (RT) in the cell culture supernatant (Reverse Transcriptase Assay, Boehringer Mannheim GmbH, Biochemica, 68298 Mannheim, Germany, Order No.: 1468 120) of every single well following the instructions of the manufacturer;

p) the tissue culture infectious doses 50 (TCID₅₀) of the CD8⁺ -depleted PBMC cultures is calculated following the method published by Karber (Karber, G. 1931. Assay for statistical analysis of pharmacological experiments. Arch. Exper. Path. V. Pharmakol. 162, 148 according to the formula:

    log TCID.sub.50 =L-d (s-0.5),

wherein

L=log of the lowest virus dilution

d=log of virus dilution

s=sum of virus-positive cell cultures;

q) inhibition of HIV-1 replication in the CD8⁺ -depleted PBMC cultures is calculated by comparison of virus content of cell culture supernatants in an assay according to steps i) to p) and the virus content of an assay according to steps i) to p) where the cell culture supernatant to be tested for inhibition of HIV replication is replaced by normal medium (untreated control);

r) inhibition is found if the amount of viral replication in CD8⁺ -depleted PBMC is inhibited in such a way that the amount of virus is only about 50%, more preferably 10%, most preferably 1% or less in comparison to the untreated control.

A further subject-matter of the invention is a therapeutic composition usefull in treating a pathological condition characterized by excessive viral replication, especially retroviral replication, comprising at least a substance which activates ISL activity in CD8⁺ T cells, and which is characterized by the properties of the above-mentioned substance assay and HIV inhibition assay, and a pharmaceutically acceptable carrier.

Such a substance, which is obtainable and characterized by the above-mentioned substance assay, is useful for the induction and/or activation of ISL in mammalian cells, for the inhibition of the replication of viruses, preferably retroviruses, especially HIV and/or HTLV, for therapeutic treatment of benign and malignant diseases and viral, preferably retroviral, especially HIV and/or HTLV, infections.

It is also particularly preferred to use said substances for therapeutic treatment of such viral infections as soon as possible after the infection, preferably as soon as possible after the first virus peak.

The following examples, sequence listing, and figures are provided to aid the understanding of the present invention, the true scope of which is set forth in the appended claims. It is understood that modifications can be made in the procedures set forth without departing from the spirit of the invention.

Sequence Listing

SEQ ID NO:1 represents the nucleotide of ISL_(agm) (African green monkey) and protein sequence derived therefrom.

SEQ ID NO:2 represents the protein sequence of ISL_(agm).

SEQ ID NO:3 represents the nucleotide of LCF and protein sequence derived therefrom.

SEQ ID NO:4 represents the protein sequence of LCF.

SEQ ID NO:5 represents Primer 1 for ISL cloning.

SEQ ID NO:6 represents Primer 2 for ISL cloning.

SEQ ID NO:7 represents the DNA sequence of HV-1_(SF2).

SEQ ID NO:8 represents the DNA sequence Of HIV-1_(SF33).

EXAMPLE 1 Cloning, Expression and Purification of ISL

1.1 RNA Isolation

5×10⁷ PBMC (human or monkey) were cultured for 48 hours with 10 μg/ml concanavalin A and 180 units/ml IL-2. In order to prepare the RNA, the cells were washed once with PBS and subsequently lysed with 5 ml denaturing solution (RNA isolation kit, Stratagene). After addition of 1 ml Na acetate, 5 ml phenol and 1 ml chioroform/isoamyl alcohol (24:1), the lysate was kept on ice for 15 minutes. The aqueous phase was subsequently admixed with 6 ml isopropanol in order to precipitate the RNA and stored for 2 hours at -20° C. The precipitate was finally washed once with absolute ethanol and dissolved in 150 μl H₂ O. The yield was determined photometrically and was 120 μg.

1.2 CDNA Synthesis

The mixture for CDNA synthesis contained 10 μg RNA, 0.2 μg oligo-dT, 13 mM DTT and 5 μl bulk first-strand reaction mix (first-strand cDNA synthesis kit, Pharnacia) in a volume of 15 μl. The reaction was incubated for 1 hour at 37° C. and subsequently. stored at -20° C. for later use.

1.3 Amplification and Cloning of ISL cDNA

For the amplification of ISL cDNA by means of PCR and for the following cloning, the following oligonucleotides were synthesized:

Primer 1: GCTGCCTCTCATATGGACCTCAACTCCTCCACTGACTCT (SEQ ID NO:5)

Primer 2: GATGGACAGGGATCCCTAGGAGTCTCCAGCAGCTGTGG (SEQ ID NO:6)

The primers introduce additional NdeI or BamHI cleavage sites.

The PCR mixtures (100 μl reaction volumes) each contained 1 μl cDNA (from the synthesis in section 3), 50 pmol primer 1 and 2, 12.5 μmol dNTPs, 10 μl 10×TAQ buffer and 2.5 units Taq polymerase (Perkin-Elmer).

The cycle conditions were 30 sec, 94° C., 1 min, 53° C. and 1 min, 72° C. 35 cycles were carried out.

The PCR products were purified and digested for 16 hours at 37° C. with NdeI and BamHI. For the cloning preparation the vector pET15b (Novagen) was also cleaved with NdeI and BamHI and subsequently purified over an agarose gel.

The ligations were carried out for 2 hours at room temperature in 20 μl mixtures containing 100 ng vector, 25 ng PCR product (Insert), 2 μl 10× ligase buffer and 0.2 μl ligase (New England Biolabs). After transformation by electroporation at 2.5 kvolt, 25 μ farad, 200 ohm (BIO-RAD electroporator) in E. coli DH5, the. cells were placed on ampiciltin-resistant plates.

Recombinant clones were identified by restriction analysis of plasmid preparations (pMISLB) and transformed into the strain BL21-DE3 for the intended protein expression. The cloning of ISL cDNA could be additionally confirmed by determining the nucleotide sequences. The sequences found agreed with the published LCF sequence (Cruikshank et al in Proc. Natl. Acad. Sci. USA, Vol. 91(1994) 5109-5113) apart from a discrepancy in codon 96. In contrast to the published sequence codon 96 is not composed of the base sequence TTG but rather of the sequence TTT and thus codes for leucin and not for phenylalanine. The sequencing of further ISL clones which were derived from independent PCR amplifications clearly showed that the authentic ISL sequence in codon 96 is indeed represented by the sequence TTT.

Proteins homologous to ISL are isolated in an analogous manner from body fluids containing CD8⁺ lymphocytes from animals infected with immunodeficiency virus and in particular from those which are infected without falling ill such as chimpanzees (P. troglodytes), African green monkeys (C. aethiops), sooty mangabees, M. mulatta chin., M. mulatta ind., M. nemestrina or M fascicularis. It is possible to use these proteins and nucleic acids therapeutically and in diagnostics in a similar manner to human ISL.

EXAMPLE 2 Expression and Purification of Recombinant, Soluble ISL

2.1 Human ISL

ISL is expressed aminoterminally in a fusion with a leader of 6 histidine residues in the vector pET15b. 20 ml overnight culture of pMISLB was used in each case to inoculate 2 litres of 2XTY/ampicillin medium. The cultures were shaken at 25° C. and when an OD₆₀₀ of 0.4 had been reached they were induced by addition of 1 ml 1M isopropyl-β-D-thiogalactoside. After a futher 4 hours the bacteria were pelleted and frozen for 14 hours at -70° C.

The pellets were subsequently thawed and washed once with 250 ml PBS. The cells were lysed in 50 ml ice-cold PBS by adjusting the suspension to 1% NP-40, 10 mM EDTA, 0.4 M NaCl and 50 μg/ml lysozyme. After 60 minutes incubation on ice the lysate was freed from insoluble components by centrufugation.

The ISL with 6 histidine residues at the amino terminus (His6-ISL) was purified by means of a chromatographic step. For this the lysate was adjusted to 20 MM MgCl₂, 10 mM imidazole, 0.5 M NaCl and applied to a NI²⁺ -NTA-Agarose (Qiagen) column with a flow rate of 0.1 ml/min. 0.25 ml Ni²⁺ -NTA-Agarose was used per litre initial culture. The column was subsequently washed with 20 volumes PBS, 25 mM imidazole and the His6-ISL was finally eluted with 4 ml PBS, 200 mM imidazole.

His6-ISL fusion protein isolated in this manner had a degree of purity of over 90% after testing in SDS gel electrophoresis. The yields were ca. 5 mg protein per litre initial culture. The purified protein was finally freed from lower molecular impurities such as e.g. imidazole by gel filtration over NAP-10 columns (Pharmacia) and transferred to PBS. Afterwards the protein concentrations were 0.5-1 mg/ml. (Purification scheme see Table 1).

                  TABLE 1                                                          ______________________________________                                         Flow diagram of the purification of ISL                                        ______________________________________                                         E. coli culture B121-DE3 transformed                                            using pMISL-1huB                                                               induce with 1 mM IPTG                                                          lyse the cells (NP-40, EDTA, lysozyme)                                         adjust to 20 mM MgCl.sub.2                                                     10 mM imidazole, 0.5 M NaCl                                                    bind to Ni.sup.2+ -NTA-Agarose                                                 wash with 25 mM imidazole/PBS                                                  elute with 200 mM imidazole/PBS                                                re-buffer in PBS                                                               check the purity by SDS-PAGE                                                   protein determination                                                         ______________________________________                                    

2.2 ISL Derivative LCF

Since LCF has an almost homologous sequence, LCF was specifically cloned from a cDNA library of activated human CD8⁺ lymphocytes, in addition to an experiment on ISL cloning, in order to examine the former for a possible anti-viral efficacy. It could be shown that LCF has an ISL action and is capable of inhibiting HIV as well as SIV replication.

2.3 ISL Derivative from African Green Monkey (ISL-agm)

A protein homologous to human ISL (SEQ ID NO:1) can be isolated from African green monkey (ISL-agm). The nucleotide sequences of human ISL (ISL-hu (SEQ ID NO:2)) differ as shown in the tables below:

                                      TABLE 2a                                     __________________________________________________________________________     Comparison of ISL-hu and ISL-agm DNA sequences                                 Nucleotide                                                                          19                                                                               72                                                                               73                                                                               92                                                                               117                                                                               156                                                                               159                                                                               162                                                                               226                                                                               257                                                                               312                                                                               339                                                                               342                                                                               348                                                                               360                                                                               361                                                                               383                           __________________________________________________________________________     ISL-hu                                                                              T T A G G  T  A  C  G  G  A  C  C  A  G  T  G                               ISL-agm A C T A T C G T A C C A T G A C C                                    __________________________________________________________________________

                  TABLE 2b                                                         ______________________________________                                         Comparison of ISL-hu and ISL-agm protein sequences                               Amino acid                                                                               7        25  31    76  86    121  128                              ______________________________________                                         ISL-hu  S        T     C     V   G     S    G                                    ISL-agm T S Y I A P A                                                        ______________________________________                                    

2.4 ISL Derivatives from Other Monkeys

Nucleic acid and protein sequences of ISL from other monkeys can be isolated in the same manner. A sequence comparison is shown in FIG. 3.

2.5 Recombinant Expression of Fusion-Free ISL in E. coli

The DNA sequence coding for ISL is modified in such fashion as to allow for efficient expression in E. coli.

For expression, an expression plasmid is transfected into a suitable E. coli strain. Such strains are, in the case of the use of an expression plasmid under the control of lac repressor such as the expression plasmid p11379, strains which possess a sufficiently high intracellular concentration of lac repressor. These kinds of strains can be prepared by transfection of a second plasmid such as pREP4 (Diagen GmbH), pUBS 500 or pUBS520 (Brinckmann et al., Gene 85 (1989) 109-114). The applied E. coli strains should preferably have a low protease activity of the cells proper, as is the case, for instance, with E. coli UT5600 (Earhart et al., FEMS Microbiology Letters 6 (1979) 277-280), E. coli BL21 (Grodberg and Dunn, J. Bacteriol. 170 (1988) 1245-1253) or E. coli B. Then, expression cultivation is accomplished in a fashion according to the state of the art, as a protein aggregate, and processed according to the procedures described in EP 0 241 022, EP 0 364 926, EP 0 219 874 and DE-A4037196.

In detail, for example, the following procedure is applied for this purpose: ISL-containing lysates from E.coli fermentations were adjusted to 6 M guanidinium hydrochloride, 100 mM TrisHCl at pH 8, 1 mM EDTA, subsequently adjusted to a pH of 3 to 4 and dialyzed against 4 M guanidinium hydrochloride at pH 3.5. The renaturing of the solubilized protein is then carried out in 1 M arginine at pH 8, 1 mM EDTA, 5 mM GSH (glutathione, reduced) and 0.5 mM GSSG (glutathione, oxidized). ISL can be further purified by usual chromatographic techniques.

2.6 Recombinant Expression of ISL in Mammalian Cells

For this, the cDNA is ligated into a vector in which it is transcribed into mammalian cells, preferably CHO or COS cells, on the basis of a strong promoter-enhancer system. Such promoters and enhancers are mostly from viruses such as SV40, hCMV, polyoma or retroviruses. As an alternative there can also be applied promoter-enhancer systems which are specific to a certain cell type or tissue type, such as, for instance, WAP-, MMTV- or immune globuline promoter, or systems which are inducible, such as, for instance, metallothioneine promoter. This kind of vector supplements the ISL cDNA (if the latter is used) with donor and acceptor signals for RNA processing as well as a signal for poly-A-addition. For example, pCMX-pL1 (Umesono et al., Cell 65 (1991) 1255-1266) is such a suitable vector. Into the one and only EcoRI cleavage site of this vector the cDNA provided with EcoRI linkers is ligated, wherein it is ensured by restriction analysis with the aid of the other cleavage sites in the polylinker of this vector that the cDNA is oriented in reading direction of the CMV promoter. An absolutely analogous procedure is applied when cloning into other vectors, e.g. into pCDNA3 (Invitrogen, San Diego/USA) or pSG5 (Stratagene, LaJolla/USA). The DNA of the so obtained expression plasmids is prepared from E. coli and transfected into the mammalian cells, applying techniques that are specific to the cell types in the particular case (Methods of Enzymology 185 (Goeddel, David V., (ed.), Gene Expression Technology, Academic Press 1991, section V). After transfection, the cells are cultured in MEM (Gibco) without addition of fetal calf serm, whereby ISL is detectable in the cell culture supernatant after 48 hours.

EXAMPLE 3 Testing the Inhibition of HIV Replication by ISL on T Cell Lymphoma Lines, Primary lymphocytes (PBMC) and purified primary CD4⁺ lymphocytes

3.1 Obtaining HIV Virus Stocks for Infecting the Cells

The human immunodeficiency viruses (HIV-1, HIV-2) and the simian immunodeficiency viruses (SIV) replicate in human T cell lymphoma lines as well as in primary CD4⁺ lymphocytes. Cell culture supernatants containing viruses that are obtained from primary lymphocyte cultures (PBMC) usually contain more infectious viruses than those that are obtained from T cell lines. In the case of the HIV 1 strain HIV-1_(SF162) replication is only possible on PBMC since this virus does not replicate in any of the known T cell lymphoma lines. The standard virus supernatants used in all the following experiments was therefore produced on primary lymphocytes. For this PBMC purified by means of a Ficoll gradient and stimulated with phytohaemagglutinin (PHA) (for a detailed description of the method see below) were infected with the HIV-1 strains HIV-1_(SF2), HIV-1_(SF33), HIV-1_(SF162) (Cheng-Mayer, C., et al, J. Virol. 64 (1990) 4390-4398) and the HIV-2 strain HIV-2_(UC3) (Castro, B. A, et al, Virology 178 (1990) 527-534). These viral strains were also passaged earlier exclusively on PBMC and the cell culture supernatants containing viruses were stored at -70° C.

For the infection 120×10⁶ PBMC with a multiplicity of infection (MOI) of 0.1 were incubated for 2 hours at 37° C., the cells were washed with RPMI medium and cultured for 12 days in 40 ml cell culture medium (RPMI 1640, 20% FCS, 2 mM glutamine, 180 U/ml IL-2) corresponding to a cell count of 3×10⁶ /ml cell culture medium. The cell culture medium was changed on the third day upon which the cell count was again adjusted to 3×10⁶ /ml medium. On days 6, 9 and 12 the cell culture supernatant was collected, cells and cell debris were removed by centrifugation, it was sterilized by filtration (0.45 μm pore size) and stored in 0.5 ml aliquots at -70° C.

3.2 Cell Culture Conditions for the Propagation of T Cell Lymphoma Lines

The T cell lymphoma lines H9, CEM, Molt 4 clone 8, MT4 and C8166 are cultured at 37° C. and 5% CO₂ atmosphere in RPMI 1640 medium, supplemented with 10% FCS and 2 mM glutamine. The cells are passaged every third day while changing the medium at the same time and the cell count is adjusted to 1×10⁵ /ml.

3.3 Preparation and Propagation of Primary Blood Lymphocytes (PBMC)

Peripheral blood lymphocytes are prepared from "buffy coats" which have been isolated from normal blood donors. Whole blood is used to prepare PBMC from non-human primates such as African green monkeys and rhesus monkeys. For this the "buffy coat" or the whole blood is layered on a Ficoll-Hypaque gradient and centrifuged for half an hour at 1000×g. The serum supernatant is discarded, the mononuclear cells are collected and washed several times in Hanks medium. The cells (3×10⁶ /ml) are taken up in RPMI 1640, 20% FCS, 2 mM glutamine and stimulated for three days with 9 μg/ml phytohaemagglutinin (PHA) and proliferated with 180 U/ml IL-2. The cells are cultured at 37° C. and 5% CO₂ atmosphere. Then the medium is completely changed and the cells are cultured further without PHA at a cell count of 3×10⁶ /ml culture medium or used in experiments.

3.4 Purifying the CD4⁺ and CD8⁺ Lymphocytes by Means of Magnetic Activated Cell Sorting (MACS)

The lymphocytes isolated by means of Ficoll gradients from a "buffy coat" are resuspended in 500 μl PBS-azide/1×10⁸ cells (phosphate buffered saline without Ca²⁺ and Mg²⁺, 0.01% sodium azide, 5 mM EDTA, pH 7.2). After addition of 20 ml CD8 microbeads/1×10⁷ expected cells (mouse anti-human CD8 antibodies, conjugated with magnetic particles, Miltenyi Biotec GmbH) they are incubated for 15 min at 4° C. 2 mg DTAF/1×10⁷ expected cells (anti-mouse IgG, FITC conjugated, Dianova Company) is added for a further 5 minutes at 4° C. After dilution with 25 ml PBS-azide/1% BSA it is again centrifuged (10 min, 1200 rpm, 4° C.). The supernatant is discarded, the cells are resuspended in 2 ml PBS/1% BSA and the cell suspension is applied to a column which is located in a magnetic separator (Miltenyi Biotec GmbH). CD8⁺ cells to which the CD8 microbeads are coupled are retained in the column, the flow fraction therefore contains all lymphocytes (ca. 80% CD4⁺ cells) except the CD8⁺ cells. After washing the column it is taken out of the holder and the CD8⁺ cell fraction is eluted with PBS-azide/1% BSA The flow fraction and the CD8⁺ cell fraction are centrifuged, resuspended in cell culture medium (RPMI 1640, 20% FCS, 2 mM glutamine, 180 U/ml IL-2), the cell count is adjusted to 3×10⁶ cells/ml and the cells are stimulated with PHA (9 mg/ml). The quality of the separation of the lymphocyte subpopulation is checked by means of FACS.

3.5 Titration of the HIV Virus Stock on Various Host Cells

PBMC, CD4⁺ lymphocytes and the T cell lymphoma lines H9 (Popovic, M., et al, Science 224 (1984) 497-500), Molt 4 clone 8 (Kikukawa, R, et al, J. Virol. 57 (1986) 1159-1162), C8166, MT4 and CEM (obtained from the American Type Culture Collection) were used as host cells. The titrations are carried out in 96-well plates.

a) Titration on PBMC and CD4⁺ Lymphocytes

The virus stocks HIV-1_(SF2), HIV-1_(SF33), HIV-1_(SF162) (Cheng-Mayer, C., Quiroga, M., Tung, J. W., Dina, D. & Levy, J. A. (1990) HIV.2_(UC3) and SIV_(agm) (Kraus, G., et al, Proc. Natl. Acad. Sci. USA 86 (1989) 2892-2896; Baier, M., et al, J. Virol. 63 (1989) 5119-5123) are diluted in three steps and 50 μl of each is pipetted into four independent PBMC or CD4⁺ lymphocyte cultures (1×10⁶ PBMC or CD4⁺ lymphocytes in each case) in 100 μl culture medium and incubated for one hour at 37° C. Viruses that are not cell bound are then removed by washing the cells with culture medium. Medium is changed (removal and addition of 100 μl medium each time) 3, 6, 9 and 12 days after infection. The cell culture supernatants of each individual culture from days 6, 9 and 12 are tested for their virus content by either carrying out

a) a test for reverse transcriptase (according to the instructions of the test manufacturer Boehringer Mannheim GmbH, Germany)

b) a p24-antigen ELISA (according to the instructions of the test manufacturer "Abbott") or

c) infections of highly susceptible indicator cell lines (Ennen, J., et al, Proc. Natl. Acad. Sci. USA 91 (1994) 7207-7211).

b) Titration on T Cell Lymphoma Lines H9, CEM, Molt4/8

The virus stocks are diluted in three steps and 50 μl of each is pipetted into independent cell cultures (in each case 5×10⁴ cells in 100 μl culture medium in U-well 96 cell culture plates) and incubated for 1 hour at 37° C. Washing and testing for infection is carried out according to the method described in 2a). The smaller initial cell count compared to PBMC cultures is due to the proliferative competence of the T cell lymphoma lines that during the course of the titration test grow to the critical cell density in the allotted cell culture volume of the microtitre plates of maximly 250 μl.

c) Titration of the T Cell Lymphoma Lines C8166 and MT4

The T cell lymphoma lines C8166 and MT4 are titrated according to the method described in 2b). The virus test is, however, not carried out using the above-mentioned test systems but the cell cultures are evaluated by light microscopy. In the case of infection by the viral strains HIV-1_(SF2), HIV-1_(SF33) and HIV-2_(UC3) the C8166 and MT4 cells are killed by proliferation of the viruses which can be easily and rapidly identified using a microscope.

3.6 Calculation of the Tissue Culture Infectious Dose 50 (TCID₅₀)

TCID₅₀ is calculated according to the method published by Karber (Karber, G. 1931. Assay for statistical analysis of pharmacological experiments. Arch. Exper. Path. V. Pharmakol. 162, 148) according to the formula:

    log TCID.sub.50 =L-d (s-0.5),

wherein

L=log of the lowest virus dilution

d=log of virus dilution

s=sum of virus-positive cell cultures

3.7 Testing Inhibition of HIV Replication by ISL

The effectiveness of ISL on HIV replication is tested on T cell lymphoma lines as well as on primary lymphocytes. The experiments start with toxicity and dose-finding experiments, in the further course one works with a non-toxic but maximally effective ISL concentration. These tests are carried out separately in 96-well microtitre plates in quadruplicates for each T cell line and PBMC as well as on total PBMC and also on CD4⁺ lymphocytes.

a) Dose-finding and Toxicity Experiments using ISL on T Cell Lines

5×10⁴ cells were incubated for 10 minutes at room temperature with various dilutions (40 μg/ml to 0.15 μg/ml) of ISL. The cells were then infected with the virus stocks mentioned under 1. They were infected in each case for one hour at 37° C. and 5% CO₂ atmosphere with 50 TCID₅₀ which was determined and calculated separately for each cell line (see above under 2.). Non-bound virus was removed by washing the cells with culture medium. The cells were resuspended in cell culture medium and the ISL concentration was adjusted to the initial concentration. In some experiments ISL was added again every day so as to ensure that the ISL concentration remained relatively constant. The medium was changed on days 3, 6, 9 and 12 after infection in the process of which the cell culture supernatants from days 6, 9 and 12 were examined quantitatively for their virus content using the test systems described supra Parallel to this growth curves of the cultures were plotted (counting cells dyed with trypan-blue by means of light microscopy) which gave insight on the toxicity of ISL.

b) Dose-finding and Toxicity Experiments using ISL on Primary PBMC

The experiments were carried out using total PBMC as well as purified CD4⁺ lymphocytes. 1×10⁶ cells were incubated with ISL under the conditions described supra, infected with HIV and the cell culture supernatants were quantitatively examined for their virus content on days 6, 9 and 12 after infection. The toxicity of ISL on PBMC was determined with the aid of trypan blue staining and counting the cells by light microscopy.

c) Tests on the Mechanism of Action of ISL

It was examined whether ISL develops its inhibitory effect at the level of de-novo infection or during persisting HIV replication. Experiments were carried out for this in which the fully effective dose of ISL in the cell culture was present only during the one hour infection period. Parallel to this a constant ISL concentration was additionally maintained in the cell culture medium during the whole test period in another experimental mixture. This experiment allows a decision whether ISL inhibits the HIV infection or HIV replication or whether it is effective at both levels (FIGS. 1 and 2).

EXAMPLE 4 Qualitative and Quantitative Detection of ISL in Body Fluids and Cell Culture Supernatants

ISL was detected by means of a modified enzyme linked immunosorbent assay (ELISA). For this monoclonal antibodies against ISL are prepared. These are absorbed onto the wells of an ELISA plate. The liquid to be tested for its ISL content is incubated, the monoclonal antibodies specifically react with ISL which is immobilized. Bound ISL is detected by means of an affinity-purified polyclonal anti-ISL antibody (obtained by immunizing a goat with ISL) which itself is made visible by means of a colour reaction using an anti-goat antibody coupled to peroxidase.

EXAMPLE 5 Direct Detection of Cells Producing ISL

The monoclonal antibody against ISL described above is used for the direct detection of cells producing ISL either for diagnostic purposes in the case of cells from patients or for cells in tissue culture. For this the cells are fixed with methanol while at the same time disrupting the cell membrane and incubated with the monoclonal anti-ISL antibody labelled with fluorescein isothiocyanate (FITC). It is evaluated with the aid of a fluorescence activated cell sorter FACS).

List of References

Ausubel I., Frederick M., Current Protocols in Mol. Biol. (1992), John Wiley and Sons, New York

Baier, M., et al, J. Virol. 63 (1989) 5119-5123

Blackbourn et al, Journal of Medical Primatology No. 23 (1994) 343-354

Brinckmann et al, Gene 85 (1989) 109-114

Buttner et al, Mol. Cell. Biol. 11 (1991) 3573-3583

Castro, B. A., et al, Virology 178 (1990) 527-534

Castro, Walker et al, Cellular Immunology 132 (1991) 246-255

Center, D. M., et al., J. Lab. Clin. Med. 125 (1995) 167-171

Chen et al, AIDS Research and Human Retroviruses Vol. 9, No. 11, 1079-1086

Cheng-Mayer, C., et al, J. Virol. 64 (1990) 4390-4398

Cheng-Mayer, C., et al, Virol. 65 (1991) 6931-6941

Cheng-Mayer, C., et al, Virol. 181 (1991) 288-294

Cruikshank and Center, Journal of Immunology 128 (1982) 2569-2574

Cruikshank et al, Journal of Immunology 138 (1987) 3817-3823

Cruikshank et al, Journal of Immunology 146 (1991) 2928-2934

Cruikshank et al, Proc. Natl. Acad. Sci. USA, Vol. 91 (1994) 5109-5113

DE-A4037 196

Doetschman et al, Nature 330 (1987) 576-578

Doetschman et al, Proc. Natl. Acad. Sci. USA 85 (1988) 8583-8587

Earhart et al, FEMS Microbiology Letters 6 (1979) 277-280

Ennen, J., et al, Proc. Natl. Acad. Sci. USA 91 (1994) 7207-7211

EP-A0 173 251

EP-A 0 063 879

EP-A 0 128 018

EP-A 0 200 362

EP0 219 874

EP 0 241 022

EP 0 364 926

Felgner et al, Proc. Natl. Acad. Sci. USA 84 (1987) 7413

Friedmann, T., Science 244 (1989) 1275

Goeddel, David V. (ed.), Methods of Enzymology 185, Gene Expression Technology, Academic Press 1991, section V

Grodberg and Dunn, J. Bacteriol. 170 (1988) 1245-1253

Hames, B. D., and Higgins, S. G., Nucleic acid hybridisation--a practical approach (1985) IRL Press, Oxford, England

Harlow and Lane eds., Antibodies: A laboratory manual (1988), Cold Spring Harbor Laboratories Press

Hsueh, Walker, et al, Cellular Immunology 159 (1994) 271-279

Huang and Terstappen, Nature 360 (1992) 745

Joag et al, Virology 200 (1994) 436-446

Kannagi et al, The Journal of Immunology, Vol. 140 (1988), No. 7, 2237-2242

Karber, G. 1931. Assay for statistical analysis of pharmacological experiments. Arch. Exper.

Path. V. Pharmakol. 162, 148

Kikukawa, R., et al, J. Virol. 57 (1986) 1159-1162

Knuchel, Bednarik, et al, Journal of Acquired Immune Deficiency Syndromes, No. 7 (1994) 438-446

Kohler and Milstein, Nature 256 (1975) 495-497

Kraus, G., et al, Proc. Natl. Acad. Sci. USA 86 (1989) 2892-2896

Kucherlapati, Proc. in Nucl. Acids Res. and Mol. Biol. 36 (1989) 301

Levy, J. A., et al, Science 232 (1986) 998-1001

Luciw, P. A., et al, Nature 312 (1984) 760-763

Mackewicz et al, AIDS Research and Human Retroviruses, Vol. 8 (1992), No. 6, 1039-1050

Mackewicz et at Lancet, 344 (1994) 1671-1673

Margolskee et al, Mol. Cell. Biol. 8 (1988) 2937

McLughlin J. Virol. 62 (1988), 1963

Morgan 1993, RAC DATA MANAGEMENT REPORT, June 1993

Moss et al, Ann. Rev. Immunol. 5 (1987) 305

Mulligan, R. C. (1991) in Nobel Symposium 8: Ethiology of human disease at the DNA level (Lindsten, J. and Pattersun Editors) 143-189, Raven Press

Norley, S. G., et al, Biologicals 21 (1993) 251-258

Popovic, M., et al, Science 224 (1984) 497-500

Rand, Cruikshank, et al, J. Exp. Med. 173 (1991) 1521-1528

Rasmussen et at Methods Enzymol. 139 (1987) 642

Sambrook et al, "Expression of cloned genes in E. coli" in Molecular Cloning: A laboratory manual (1989) Cold Spring Harbor Laboratory Press, New York, USA

Sanchez-Pescador, R., et al, Science 227 (1985) 484-492

Southern, E. M., J. Mol. Biol. 98 (1975) 503-517

Terstappen et al, Blood 77 (1991) 1218

Thomas and Capecchi, Cell 51 (1987) 503-512

Thomas et al, Cell 44 (1986) 419-428

Umesono et al, Cell 65 (1991) 1255-1266

U.S. Pat. No. 2,915,082

U.S. Pat. No. 5,399,346

U.S. Pat. No. 5,399,346

Wahl, G. M., et al, Proc. Natl. Acad. Sci. USA 76 (1979) 3683-3687

Walker, C. M., and Leighly, J. A, Immunology, Vol. 66 (1989) 628-630

Walker, Erickson, et al, Journal of Virology, Vol. 65 (1991) No. 11, 5921-5927

Walker, Moody, et al, Science, Vol. 234 (1986) 1563-1566

Walker, Thomson-Honnebier, et al, Cellular Immunology 137 (1991) 420-428

WO 89/06698

WO 93/0883

WO 94/23058

WO 94/28134

    __________________________________________________________________________     #             SEQUENCE LISTING                                                    - -  - - <160> NUMBER OF SEQ ID NOS: 8                                         - - <210> SEQ ID NO 1                                                         <211> LENGTH: 393                                                              <212> TYPE: DNA                                                                <213> ORGANISM: african green monkey                                            - - <400> SEQUENCE: 1                                                          - - atgcccgacc tcaactccac cactgactct gcagcctcag cctctgcagc ca -             #gtgatgtt     60                                                                  - - tctgtagaat cctcagcaga ggccacagtc tacacggtga cactggagaa ga -             #tgtctgca    120                                                                  - - gggctgggct tcagcctgga aggagggaag ggctccctgc atggagacaa gc -             #ctctcacc    180                                                                  - - attaacagga ttttcaaagg agcagcctca gaacaaagtg agacaatcca gc -             #ctggagat    240                                                                  - - gaaatcttgc agctggctgg cactgccatg cagggcctca cacggtttga ag -             #cctggaac    300                                                                  - - atcatcaagg ccctgcctga tggacctgtc acgattgtaa ttaggaggaa aa -             #gcctccaa    360                                                                  - - cccaaggaaa ccacagctgc tgcagactcc tag       - #                  -       #        393                                                                      - -  - - <210> SEQ ID NO 2                                                    <211> LENGTH: 130                                                              <212> TYPE: PRT                                                                <213> ORGANISM: african green monkey                                            - - <400> SEQUENCE: 2                                                          - - Met Pro Asp Leu Asn Ser Thr Thr Asp Ser Al - #a Ala Ser Ala Ser         Ala                                                                                1               5 - #                 10 - #                 15              - - Ala Ser Asp Val Ser Val Glu Ser Ser Ala Gl - #u Ala Thr Val Tyr Thr                    20     - #             25     - #             30                   - - Val Thr Leu Glu Lys Met Ser Ala Gly Leu Gl - #y Phe Ser Leu Glu Gly                35         - #         40         - #         45                       - - Gly Lys Gly Ser Leu His Gly Asp Lys Pro Le - #u Thr Ile Asn Arg Ile            50             - #     55             - #     60                           - - Phe Lys Gly Ala Ala Ser Glu Gln Ser Glu Th - #r Ile Gln Pro Gly Asp        65                 - # 70                 - # 75                 - # 80        - - Glu Ile Leu Gln Leu Ala Gly Thr Ala Met Gl - #n Gly Leu Thr Arg Phe                        85 - #                 90 - #                 95               - - Glu Ala Trp Asn Ile Ile Lys Ala Leu Pro As - #p Gly Pro Val Thr Ile                   100      - #           105      - #           110                   - - Val Ile Arg Arg Lys Ser Leu Gln Pro Lys Gl - #u Thr Thr Ala Ala Ala               115          - #       120          - #       125                       - - Asp Ser                                                                       130                                                                         - -  - - <210> SEQ ID NO 3                                                    <211> LENGTH: 2151                                                             <212> TYPE: DNA                                                                <213> ORGANISM: Homo sapiens                                                    - - <400> SEQUENCE: 3                                                          - - ttcctcgaga gctgtcaaca caggctgagg aatctcaagg cccagtgctc aa -              #gatgccta     60                                                                  - - gccagcgagc acggagcttc cccctgacca ggtcccagtc ctgtgagacg aa -             #gctacttg    120                                                                  - - acgaaaagac cagcaaactc tattctatca ccagccagtg tcatcggctg tc -             #atgaaatc    180                                                                  - - cttgctgtgc cttccatctt ctatctcctg tgcccagact ccctgcatcc cc -             #aaggaagg    240                                                                  - - ggcatctcca acatcatcat ccaacgaaga ctcagctgca aatggttctg ct -             #gaaacatc    300                                                                  - - tgccttggac acggggttct cgctcaacct ttcagagctg agagaatata ca -             #gagggtct    360                                                                  - - cacggaagcc aaggaagacg atgatgggga ccacagttcc ttcagtctgg tc -             #agtccgtt    420                                                                  - - atctccctgc tgagctcaga agaattaaaa aaactcatcg aggaggtgaa gg -             #ttctggat    480                                                                  - - gaagcaacat taaagcaatt agacggcatc catgtcacca tcttacacaa gg -             #aggaaggt    540                                                                  - - gctggtcttg ggttcagctt ggcaggagga gcagatctag aaaacaaggt ga -             #ttacggtt    600                                                                  - - cacagagtgt ttccaaatgg gctggcctcc caggaaggga ctattcagaa gg -             #gcaatgag    660                                                                  - - gttctttcca tcaacggcaa gtctctcaag gggaccacgc accatgatgc ct -             #tggccatc    720                                                                  - - ctccgccaag ctcgagagcc caggcaagct gtgattgtca caaggaagct ga -             #ctccagag    780                                                                  - - ccatgcccga cctcaactcc tccactgact ctgcagcctc agcctctgca gc -             #cagtgatg    840                                                                  - - tttctgtaga atctacagca gaggccacag tctgcacggt gacactggag aa -             #gatgtcgg    900                                                                  - - cagggctggg cttcagcctg gaaggaggga agggctccct acacggagac aa -             #gcctctca    960                                                                  - - ccattaacag gattttcaaa ggagcagcct cagaacaaag tgagacagtc ca -             #gcctggag   1020                                                                  - - atgaaatctt gcagctgggt ggcactgcca tgcagggcct cacacggttt ga -             #agcctgga   1080                                                                  - - acatcatcaa ggcactgcct gatggacctg tcacgattgt catcaggaga aa -             #aagcctcc   1140                                                                  - - agtccaagga aaccacagct gctggagact cctaggcagg acatgctgaa gc -             #caaagcca   1200                                                                  - - ataacacaca gctaacacac agctcccata accgctgatt ctcagggtct ct -             #gctgccgc   1260                                                                  - - cccacccaga tgggggaaag cacaggtggg cttcccagtg gctgctgccc ag -             #gcccagac   1320                                                                  - - cttctaggac gccacccagc aaaaggttgt tcctaaaata agggcagagt ca -             #cactgggg   1380                                                                  - - cagctgatac aaattgcaga ctgtgtaaaa agagagctta atgataatat tg -             #tggtgcca   1440                                                                  - - caaataaaat ggatttatta gaatttcata tgacattcat gcctggcttc gc -             #aaaatgtt   1500                                                                  - - tcaagtactg taactgtgtc atgattcacc cccaaacagt gacatttatt tt -             #tctcatga   1560                                                                  - - atctgcaatg tgggcagaga ttggaatggg cagctcatct ctgtcccact tg -             #gcatcagc   1620                                                                  - - tggcgtcatg caaagtcatg caaaggctgg gaccacgtga gatcattcac tc -             #atacatct   1680                                                                  - - ggccgttgat gttggctggg aactcacctg gggctgctgg cctgaatgct ta -             #taggtggc   1740                                                                  - - ctctccttgt ggcctgggct cctcacaaca tggtgtctgg attcccagga tg -             #agcatccc   1800                                                                  - - aggatcgcaa gagccacgta gaagctgcat cttgtttata cctttgcctt gg -             #aagttgca   1860                                                                  - - tggcatcacc tccaccatac tccatcagtt agagctgaca caaacctgcc tg -             #ggtttaag   1920                                                                  - - gggagaggaa atattgctgg ggtcatttat gaaaaataca gtttgtcaca tg -             #aaacattt   1980                                                                  - - gcaaaattgt ttttggttgg attggagaag taatcctagg gaagggtggt gg -             #agccagta   2040                                                                  - - aatagaggag tacaggtgaa gcaccaagct caaagcgtgg acaggtgtgc cg -             #acagaagg   2100                                                                  - - aaccagcgtg tatatgaggg tatcaaataa aattgctact acttacctac c - #                2151                                                                         - -  - - <210> SEQ ID NO 4                                                    <211> LENGTH: 130                                                              <212> TYPE: PRT                                                                <213> ORGANISM: Homo sapiens                                                    - - <400> SEQUENCE: 4                                                          - - Met Pro Asp Leu Asn Ser Ser Thr Asp Ser Al - #a Ala Ser Ala Ser Ala         1               5 - #                 10 - #                 15               - - Ala Ser Asp Val Ser Val Glu Ser Thr Ala Gl - #u Ala Thr Val Cys Thr                    20     - #             25     - #             30                   - - Val Thr Leu Glu Lys Met Ser Ala Gly Leu Gl - #y Phe Ser Leu Glu Gly                35         - #         40         - #         45                       - - Gly Lys Gly Ser Leu His Gly Asp Lys Pro Le - #u Thr Ile Asn Arg Ile            50             - #     55             - #     60                           - - Phe Lys Gly Ala Ala Ser Glu Gln Ser Glu Th - #r Val Gln Pro Gly Asp        65                 - # 70                 - # 75                 - # 80        - - Glu Ile Leu Gln Leu Gly Gly Thr Ala Met Gl - #n Gly Leu Thr Arg Phe                        85 - #                 90 - #                 95               - - Glu Ala Trp Asn Ile Ile Lys Ala Leu Pro As - #p Gly Pro Val Thr Ile                   100      - #           105      - #           110                   - - Val Ile Arg Arg Lys Ser Leu Gln Ser Lys Gl - #u Thr Thr Ala Ala Gly               115          - #       120          - #       125                       - - Asp Ser                                                                       130                                                                         - -  - - <210> SEQ ID NO 5                                                    <211> LENGTH: 39                                                               <212> TYPE: DNA                                                                <213> ORGANISM: Artificial Sequence                                            <220> FEATURE:                                                                 <223> OTHER INFORMATION: Description of Artificial - #Sequence:primer           - - <400> SEQUENCE: 5                                                          - - gctgcctctc atatggacct caactcctcc actgactct      - #                       - #    39                                                                       - -  - - <210> SEQ ID NO 6                                                    <211> LENGTH: 38                                                               <212> TYPE: DNA                                                                <213> ORGANISM: Artificial Sequence                                            <220> FEATURE:                                                                 <223> OTHER INFORMATION: Description of Artificial - #Sequence:primer           - - <400> SEQUENCE: 6                                                          - - gatggacagg gatccctagg agtctccagc agctgtgg      - #                       - #     38                                                                       - -  - - <210> SEQ ID NO 7                                                    <211> LENGTH: 9737                                                             <212> TYPE: DNA                                                                <213> ORGANISM: Human immunodeficiency virus type - #1                          - - <400> SEQUENCE: 7                                                          - - ctggaagggc taatttggtc ccaaagaaga caagagatcc ttgatctgtg ga -              #tctaccac     60                                                                  - - acacaaggct acttccctga ttggcagaat tacacaccag ggccagggat ca -             #gatatcca    120                                                                  - - ctgacctttg gatggtgctt caagctagta ccagttgagc cagagaaggt ag -             #aagaggcc    180                                                                  - - aatgaaggag agaacaacag cttgttacac cctatgagcc tgcatgggat gg -             #aggacgcg    240                                                                  - - gagaaagaag tgttagtgtg gaggtttgac agcaaactag catttcatca ca -             #tggcccga    300                                                                  - - gagctgcatc cggagtacta caaagactgc tgacatcgag ctttctacaa gg -             #gactttcc    360                                                                  - - gctggggact ttccagggag gcgtggcctg ggcgggactg gggagtggcg tc -             #cctcagat    420                                                                  - - gctgcatata agcagctgct ttttgcctgt actgggtctc tctggttaga cc -             #agatctga    480                                                                  - - gcctgggagc tctctggcta actagggaac ccactgctta agcctcaata aa -             #gcttgcct    540                                                                  - - tgagtgcttc aagtagtgtg tgcccgtctg ttgtgtgact ctggtaacta ga -             #gatccctc    600                                                                  - - agaccctttt agtcagtgtg gaaaaatctc tagcagtggc gcccgaacag gg -             #acgcgaaa    660                                                                  - - gcgaaagtag aaccagagga gctctctcga cgcaggactc ggcttgctga ag -             #cgcgcaca    720                                                                  - - gcaagaggcg aggggcggcg actggtgagt acgccaattt ttgactagcg ga -             #ggctagaa    780                                                                  - - ggagagagag atgggtgcga gagcgtcggt attaagcggg ggagaattag at -             #aaatggga    840                                                                  - - aaaaattcgg ttaaggccag ggggaaagaa aaaatataag ttaaaacata ta -             #gtatgggc    900                                                                  - - aagcagggag ctagaacgat tcgcagtcaa tcctggcctg ttagaaacat ca -             #gaaggctg    960                                                                  - - cagacaaata ttgggacagc tacagccatc ccttcagaca ggatcagaag aa -             #cttagatc   1020                                                                  - - attatataat acagtagcaa ccctctattg tgtacatcaa aggatagatg ta -             #aaagacac   1080                                                                  - - caaggaagct ttagagaaga tagaggaaga gcaaaacaaa agtaagaaaa ag -             #gcacagca   1140                                                                  - - agcagcagct gcagctggca caggaaacag cagccaggtc agccaaaatt ac -             #cctatagt   1200                                                                  - - gcagaaccta caggggcaaa tggtacatca ggccatatca cctagaactt ta -             #aatgcatg   1260                                                                  - - ggtaaaagta gtagaagaaa aggctttcag cccagaagta atacccatgt tt -             #tcagcatt   1320                                                                  - - atcagaagga gccaccccac aagatttaaa caccatgcta aacacagtgg gg -             #ggacatca   1380                                                                  - - agcagccatg caaatgttaa aagagactat caatgaggaa gctgcagaat gg -             #gatagagt   1440                                                                  - - gcatccagtg catgcagggc ctattgcacc aggccaaatg agagaaccaa gg -             #ggaagtga   1500                                                                  - - catagcagga actactagta cccttcagga acaaatagga tggatgacaa at -             #aatccacc   1560                                                                  - - tatcccagta ggagaaatct ataaaagatg gataatcctg ggattaaata aa -             #atagtaag   1620                                                                  - - aatgtatagc cctaccagca ttctggacat aagacaagga ccaaaggaac cc -             #tttagaga   1680                                                                  - - ttatgtagac cggttctata aaactctaag agccgaacaa gcttcacagg at -             #gtaaaaaa   1740                                                                  - - ttggatgaca gaaaccttgt tggtccaaaa tgcaaaccca gattgtaaga ct -             #attttaaa   1800                                                                  - - agcattggga ccagcagcta cactagaaga aatgatgaca gcatgtcagg ga -             #gtgggggg   1860                                                                  - - acccggccat aaagcaagag ttttggctga agccatgagc caagtaacaa at -             #ccagctaa   1920                                                                  - - cataatgatg cagagaggca attttaggaa ccaaagaaag actgttaagt gt -             #ttcaattg   1980                                                                  - - tggcaaagaa gggcacatag ccaaaaattg cagggcccct aggaaaaagg gc -             #tgttggag   2040                                                                  - - atgtggaagg gaaggacacc aaatgaaaga ttgcactgag agacaggcta at -             #tttttagg   2100                                                                  - - gaagatctgg ccttcctaca agggaaggcc agggaatttt cttcagagca ga -             #ccagagcc   2160                                                                  - - aacagcccca ccagaagaga gcttcaggtt tggggaggag aaaacaactc cc -             #tctcagaa   2220                                                                  - - gcaggagccg atagacaagg aactgtatcc tttaacttcc ctcagatcac tc -             #tttggcaa   2280                                                                  - - cgacccctcg tcacaataag gatagggggg caactaaagg aagctctatt ag -             #atacagga   2340                                                                  - - gcagatgata cagtattaga agaaatgaat ttgccaggaa aatggaaacc aa -             #aaatgata   2400                                                                  - - gggggaattg gaggttttat caaagtaaga cagtacgatc agatacctgt ag -             #aaatctgt   2460                                                                  - - ggacataaag ctataggtac agtattagta ggacctacac ctgtcaacat aa -             #ttggaaga   2520                                                                  - - aatctgttga ctcagattgg ttgtacttta aatttcccca ttagtcctat tg -             #aaactgta   2580                                                                  - - ccagtaaaat taaagccagg aatggatggc ccaaaagtta agcaatggcc at -             #tgacagaa   2640                                                                  - - gaaaaaataa aagcattagt agagatatgt acagaaatgg aaaaggaagg ga -             #aaatttca   2700                                                                  - - aaaattgggc ctgaaaatcc atacaatact ccagtatttg ctataaagaa aa -             #aagacagt   2760                                                                  - - actaaatgga gaaaactagt agatttcaga gaacttaata aaagaactca ag -             #acttctgg   2820                                                                  - - gaagttcagt taggaatacc acaccccgca gggttaaaaa agaaaaaatc ag -             #taacagta   2880                                                                  - - ttggatgtgg gtgatgcata cttttcagtt cccttagata aagactttag aa -             #agtatact   2940                                                                  - - gcatttacca tacctagtat aaacaatgag acaccaggga ttagatatca gt -             #acaatgtg   3000                                                                  - - ctgccacagg gatggaaagg atcaccagca atattccaaa gtagcatgac aa -             #aaatctta   3060                                                                  - - gagcctttta gaaaacagaa tccagacata gttatctatc aatacatgga tg -             #atttgtat   3120                                                                  - - gtaggatctg acttagaaat agggcagcat agaacaaaaa tagaggaact ga -             #gacagcat   3180                                                                  - - ctgttgaggt ggggatttac cacaccagac aaaaaacatc agaaagaacc tc -             #cattcctt   3240                                                                  - - tggatgggtt atgaactcca tcctgataaa tggacagtac agcctataat gc -             #tgccagaa   3300                                                                  - - aaagacagct ggactgtcaa tgacatacag aagttagtgg gaaaattgaa tt -             #gggcaagt   3360                                                                  - - cagatttatg cagggattaa agtaaagcag ttatgtaaac tccttagagg aa -             #ccaaagca   3420                                                                  - - ctaacagaag taataccact aacagaagaa gcagagctag aactggcaga aa -             #acagggag   3480                                                                  - - attctaaaag aaccagtaca tgaagtatat tatgacccat caaaagactt ag -             #tagcagaa   3540                                                                  - - atacagaagc aggggcaagg ccaatggaca tatcaaattt atcaagagcc at -             #ttaaaaat   3600                                                                  - - ctgaaaacag gaaagtatgc aaggatgagg ggtgcccaca ctaatgatgt aa -             #aacagtta   3660                                                                  - - acagaggcag tgcaaaaagt atccacagaa agcatagtaa tatggggaaa ga -             #ttcctaaa   3720                                                                  - - tttaaactac ccatacaaaa ggaaacatgg gaagcatggt ggatggagta tt -             #ggcaagct   3780                                                                  - - acctggattc ctgagtggga gtttgtcaat acccctccct tagtgaaatt at -             #ggtaccag   3840                                                                  - - ttagagaaag aacccatagt aggagcagaa actttctatg tagatggggc ag -             #ctaatagg   3900                                                                  - - gagactaaat taggaaaagc aggatatgtt actgacagag gaagacaaaa ag -             #ttgtctcc   3960                                                                  - - atagctgaca caacaaatca gaagactgaa ttacaagcaa ttcatctagc tt -             #tgcaggat   4020                                                                  - - tcgggattag aagtaaacat agtaacagac tcacaatatg cattaggaat ca -             #ttcaagca   4080                                                                  - - caaccagata agagtgaatc agagttagtc agtcaaataa tagagcagtt aa -             #taaaaaag   4140                                                                  - - gaaaaggtct acctggcatg ggtaccagca cacaaaggaa ttggaggaaa tg -             #aacaagta   4200                                                                  - - gataaattag tcagtgctgg aatcaggaaa gtactatttt tgaatggaat ag -             #ataaggcc   4260                                                                  - - caagaagaac atgagaaata tcacagtaat tggagagcaa tggctagtga tt -             #ttaacctg   4320                                                                  - - ccacctgtag tagcaaaaga aatagtagcc agctgtgata aatgtcagct aa -             #aaggagaa   4380                                                                  - - gccatgcatg gacaagtaga ctgtagtcca ggaatatggc aactagattg ta -             #cacatcta   4440                                                                  - - gaaggaaaaa ttatcctggt agcagttcat gtagccagtg gatatataga ag -             #cagaagtt   4500                                                                  - - attccagcag agacagggca ggaaacagca tattttctct taaaattagc ag -             #gaagatgg   4560                                                                  - - ccagtaaaaa caatacatac agacaatggc agcaatttca ccagtactac gg -             #ttaaggcc   4620                                                                  - - gcctgttggt gggcagggat caagcaggaa tttggcattc cctacaatcc cc -             #aaagtcaa   4680                                                                  - - ggagtagtag aatctatgaa taatgaatta aagaaaatta taggacaggt aa -             #gagatcag   4740                                                                  - - gctgaacacc ttaagacagc agtacaaatg gcagtattca tccacaattt ta -             #aaagaaaa   4800                                                                  - - ggggggattg ggggatacag tgcaggggaa agaatagtag acataatagc aa -             #cagacata   4860                                                                  - - caaactaaag aactacaaaa gcaaattaca aaaattcaaa attttcgggt tt -             #attacagg   4920                                                                  - - gacaacaaag atcccctttg gaaaggacca gcaaagcttc tctggaaagg tg -             #aaggggca   4980                                                                  - - gtagtaatac aagataatag tgacataaaa gtagtgccaa gaagaaaagc aa -             #aaatcatt   5040                                                                  - - agggattatg gaaaacagat ggcaggtgat gattgtgtgg caagtagaca gg -             #atgaggat   5100                                                                  - - tagaacatgg aaaagtttag taaaacacca tatgtatatt tcaaagaaag ct -             #aaaggatg   5160                                                                  - - gttttataga catcactatg aaagtactca tccaagagta agttcagaag ta -             #cacatccc   5220                                                                  - - cctaggggat gctaaattgg taataacaac atattggggt ctgcatacag ga -             #gaaagaga   5280                                                                  - - atggcatttg ggccagggag tcgccataga atggaggaaa aagaaatata gc -             #acacaagt   5340                                                                  - - agaccctggc ctagcagacc aactaattca tctgcattat tttgattgtt tt -             #tcagaatc   5400                                                                  - - tgctataaaa aatgccatat taggatatag agttagtcct aggtgtgaat at -             #caagcagg   5460                                                                  - - acataacaag gtaggatctc tacaatactt ggcactagca gcattaataa ca -             #ccaaaaaa   5520                                                                  - - gacaaagcca cctttgccta gtgttaagaa actgacagag gatagatgga ac -             #aagcccca   5580                                                                  - - gaagaccaag ggccacagag ggagccatac aatgaatgga cactagagct tt -             #tagaggag   5640                                                                  - - cttaagagag aagctgttag acattttcct aggccatggc tccatagctt ag -             #gacaatat   5700                                                                  - - atctatgaaa cttatgggga tacttgggca ggagtggaag ccataataag aa -             #ttctgcaa   5760                                                                  - - caactgctgt ttattcattt cagaattggg tgtcaacata gcagaatagg ca -             #ttattcaa   5820                                                                  - - cagaggagag caagaagaaa tggagccagt agatcctaat ctagagccct gg -             #aagcatcc   5880                                                                  - - aggaagtcag cctaggactg cttgtaacaa ttgctattgt aaaaagtgtt gc -             #tttcattg   5940                                                                  - - ctacgcgtgt ttcacaagaa aaggcttagg catctcctat ggcaggaaga ag -             #cggagaca   6000                                                                  - - gcgacgaaga gctcctcagg acagtcagac tcatcaagct tctctatcaa ag -             #cagtaagt   6060                                                                  - - agtaaatgta atgcaatctt tacaaatatt agcaatagta tcattagtag ta -             #gtagcaat   6120                                                                  - - aatagcaata gttgtgtgga ccatagtact catagaatat aggaaaatat ta -             #agacaaag   6180                                                                  - - aaaatagaca gattaattga tagaataaga gaaaaagcag aagacagtgg ca -             #atgaaagt   6240                                                                  - - gaaggggacc aggaggaatt atcagcactt gtggagatgg ggcaccttgc tc -             #cttgggat   6300                                                                  - - gttgatgatc tgtagtgcta cagaaaaatt gtgggtcaca gtttattatg ga -             #gtacctgt   6360                                                                  - - gtggaaagaa gcaactacca ctctattttg tgcatcagat gctagagcat at -             #gatacaga   6420                                                                  - - ggtacataat gtttgggcca cacatgcctg tgtacccaca gaccccaacc ca -             #caagaagt   6480                                                                  - - agtattggga aatgtgacag aaaattttaa catgtggaaa aataacatgg ta -             #gaacagat   6540                                                                  - - gcaggaggat ataatcagtt tatgggatca aagcctaaag ccatgtgtaa aa -             #ttaacccc   6600                                                                  - - actctgtgtt actttaaatt gcactgattt ggggaaggct actaatacca at -             #agtagtaa   6660                                                                  - - ttggaaagaa gaaataaaag gagaaataaa aaactgctct ttcaatatca cc -             #acaagcat   6720                                                                  - - aagagataag attcagaaag aaaatgcact ttttcgtaac cttgatgtag ta -             #ccaataga   6780                                                                  - - taatgctagt actactacca actataccaa ctataggttg atacattgta ac -             #agatcagt   6840                                                                  - - cattacacag gcctgtccaa aggtatcatt tgagccaatt cccatacatt at -             #tgtacccc   6900                                                                  - - ggctggtttt gcgattctaa agtgtaataa taaaacgttc aatggaaaag ga -             #ccatgtac   6960                                                                  - - aaatgtcagc acagtacaat gtacacatgg aattaggcca atagtgtcaa ct -             #caactgct   7020                                                                  - - gttaaatggc agtctagcag aagaagaggt agtaattaga tctgacaatt tc -             #acgaacaa   7080                                                                  - - tgctaaaacc ataatagtac agctgaatga atctgtagca attaactgta ca -             #agacccaa   7140                                                                  - - caacaataca agaaaaagta tctatatagg accagggaga gcatttcata ca -             #acaggaag   7200                                                                  - - aataatagga gatataagaa aagcacattg taacattagt agagcacaat gg -             #aataacac   7260                                                                  - - tttagaacag atagttaaaa aattaagaga acagtttggg aataataaaa ca -             #atagtctt   7320                                                                  - - taatcaatcc tcaggagggg acccagaaat tgtaatgcac agttttaatt gt -             #agagggga   7380                                                                  - - atttttctac tgtaatacaa cacaactgtt taataataca tggaggttaa at -             #cacactga   7440                                                                  - - aggaactaaa ggaaatgaca caatcatact cccatgtaga ataaaacaaa tt -             #ataaacat   7500                                                                  - - gtggcaggaa gtaggaaaag caatgtatgc ccctcccatt ggaggacaaa tt -             #agttgttc   7560                                                                  - - atcaaatatt acagggctgc tattaacaag agatggtggt acaaatgtaa ct -             #aatgacac   7620                                                                  - - cgaggtcttc agacctggag gaggagatat gagggacaat tggagaagtg aa -             #ttatataa   7680                                                                  - - atataaagta ataaaaattg aaccattagg aatagcaccc accaaggcaa ag -             #agaagagt   7740                                                                  - - ggtgcagaga gaaaaaagag cagtgggaat agtaggagct atgttccttg gg -             #ttcttggg   7800                                                                  - - agcagcagga agcactatgg gcgcagtgtc attgacgctg acggtacagg cc -             #agacaatt   7860                                                                  - - attgtctggt atagtgcaac agcagaacaa tttgctgagg gctattgagg cg -             #caacaaca   7920                                                                  - - tctgttgcaa ctcacagtct ggggcatcaa gcagctccag gcaagagtcc tg -             #gctgtgga   7980                                                                  - - aagataccta agggatcaac agctcctagg gatttggggt tgctctggaa aa -             #ctcatttg   8040                                                                  - - caccactgct gtgccttgga atgctagttg gagtaataaa tctctggaag ac -             #atttggga   8100                                                                  - - taacatgacc tggatgcagt gggaaagaga aattgacaat tacacaaaca ca -             #atatacac   8160                                                                  - - cttacttgaa gaatcgcaga accaacaaga aaagaatgaa caagaattat ta -             #gaattgga   8220                                                                  - - taagtgggca agtttgtgga attggtttag cataacaaac tggctgtggt at -             #ataaagat   8280                                                                  - - attcataatg atagtaggag gcttggtagg tttaagaata gtttttgctg tg -             #ctttctat   8340                                                                  - - agtgaataga gttaggcagg gatactcacc attgtcattt cagacccgcc tc -             #ccagtccc   8400                                                                  - - gaggggaccc gacaggcccg acggaatcga agaagaaggt ggagagagag ac -             #agagacag   8460                                                                  - - atccgttcga ttagtggatg gattcttagc acttatctgg gaagatctgc gg -             #agcctgtg   8520                                                                  - - cctcttcagc taccgccgct tgagagactt actcttgatt gcagcgagga ct -             #gtggaaat   8580                                                                  - - tctggggcac agggggtggg aagccctcaa atattggtgg agtctcctgc ag -             #tattggat   8640                                                                  - - tcaggaacta aagaatagtg ctgttagctg gctcaacgcc acagctatag ca -             #gtaactga   8700                                                                  - - ggggacagat agggttatag aagtagcaca aagagcttat agagctattc tc -             #cacataca   8760                                                                  - - tagaagaatt agacagggct tggaaaggct tttgctataa gatgggtggc aa -             #gtggtcaa   8820                                                                  - - aacgtagtat gggtggatgg tctgctataa gggaaagaat gagacgagct ga -             #gccacgag   8880                                                                  - - ctgagccagc agcagatggg gtgggagcag tatctcgaga cctggaaaaa ca -             #tggagcaa   8940                                                                  - - tcacaagtag caatacagca gctactaatg ctgattgtgc ctggctagaa gc -             #acaagagg   9000                                                                  - - aggaagaggt gggttttcca gtcagacctc aggtaccttt aagaccaatg ac -             #ttacaagg   9060                                                                  - - cagctttaga tattagccac tttttaaaag aaaagggggg actggaaggg ct -             #aatttggt   9120                                                                  - - cccaaagaag acaagagatc cttgatctgt ggatctacca cacacaaggc ta -             #cttccctg   9180                                                                  - - attggcagaa ttacacacca gggccaggga tcagatatcc actgaccttt gg -             #atggtgct   9240                                                                  - - tcaagctagt accagttgag ccagagaagg tagaagaggc caatgaagga ga -             #gaacaaca   9300                                                                  - - gcttgttaca ccctatgagc ctgcatggga tggaggacgc ggagaaagaa gt -             #gttagtgt   9360                                                                  - - ggaggtttga cagcaaacta gcatttcatc acatggcccg agagctgcat cc -             #ggagtact   9420                                                                  - - acaaagactg ctgacatcga gctttctaca agggactttc cgctggggac tt -             #tccaggga   9480                                                                  - - ggcgtggcct gggcgggact ggggagtggc gtccctcaga tgctgcatat aa -             #gcagctgc   9540                                                                  - - tttttgcctg tactgggtct ctctggttag accagatctg agcctgggag ct -             #ctctggct   9600                                                                  - - aactagggaa cccactgctt aagcctcaat aaagcttgcc ttgagtgctt ca -             #agtagtgt   9660                                                                  - - gtgcccgtct gttgtgtgac tctggtaact agagatccct cagacccttt ta -             #gtcagtgt   9720                                                                  - - ggaaaaatct ctagcag             - #                  - #                       - # 9737                                                                   - -  - - <210> SEQ ID NO 8                                                    <211> LENGTH: 4527                                                             <212> TYPE: DNA                                                                <213> ORGANISM: Human immunodeficiency virus type - #1                          - - <400> SEQUENCE: 8                                                          - - gaattctgca acaactgctg tttattcatt tcagaattgg gtgccaacat ag -              #cagaatag     60                                                                  - - gcattactcg acagaggaga gcaagaaatg gagccagtag atcctaacct ag -             #agccctgg    120                                                                  - - aagcatccag gaagtcagcc taggactgct tgcaccaact gctattgtaa aa -             #agtgttgc    180                                                                  - - tttcattgcc aagtttgctt cataacaaaa ggcttaggca tatcctatgg ca -             #ggaagaag    240                                                                  - - cggaggcagc gacagagagc tcctgacagc agtcagaatc atcaagattc tc -             #tatcaaag    300                                                                  - - cagtaagtag tacatgtaat gtaatcttta acaatattag caatagtagc aa -             #tagtagta    360                                                                  - - gtaacaataa tagcaatagt tatatggacc atagtattaa taaaatatag ga -             #aaatatta    420                                                                  - - agacaaagaa aaatagacag attaattgat agaataagag aaagagcaga ag -             #acagtggc    480                                                                  - - aatgagagcg agggagacca ggaagaatta tcagtgcttg tggagatggg gc -             #acgatgct    540                                                                  - - ccttgggatg ttaatgatct gtagtgctgc agaaaatttg tgggtcacag tt -             #tattatgg    600                                                                  - - ggtacctgtg tggaaagatg caaccactac tctattttgt gcatcagatg ct -             #aaagcata    660                                                                  - - tgatacagag gtacataatg tttgggccac acatgcctgt gtacccacag ac -             #cccaaccc    720                                                                  - - ccaagaagta gtattgggaa atgtgacaga aaattttaac atgtggaaaa at -             #aacatggt    780                                                                  - - agaccagatg catgaggata tagtcagttt atgggatcaa agcctaaagc ca -             #tgtgtaaa    840                                                                  - - attaacccca ctctgtgtta ctttaaattg cactgattat ttggggaatg ct -             #actaatac    900                                                                  - - caacaatagt agtgggggaa cggtggagaa agaagaaata aaaaactgct ct -             #ttcaatat    960                                                                  - - caccacaggc ataagagata aggtacagaa ggcatatgca tatttttata aa -             #cttgatgt   1020                                                                  - - agtaccaata gatgatgata atactaatac cagctatagg ttgatacatt gt -             #aattcctc   1080                                                                  - - agtcattaca cagacctgtc caaaggtatc ctttgagcca attcctatac at -             #tattgtgc   1140                                                                  - - cccggctggt tttgcgattc taaagtgtaa taataagaag ttcagtggaa aa -             #ggtcaatg   1200                                                                  - - tacaaatgtc agcacagtac aatgtacaca tggaattaag ccagtagtgt ca -             #actcaact   1260                                                                  - - gctgttaaat ggcagtctag cagaagaaga ggtagtaatt agatctgaca at -             #ttcacgaa   1320                                                                  - - caatgctaaa accatattag tacagctgaa tgtatctgta gaaattaatt gt -             #acaagacc   1380                                                                  - - caacaacaat agaagaagaa ggataactag tggaccaggg aaagtacttt at -             #acaacagg   1440                                                                  - - agaaataata ggagatataa gaaaagcata ttgtaacatt agtagagcaa aa -             #tggaataa   1500                                                                  - - aactttagaa caggtagcta caaaattaag agaacaattt gggaataaaa ca -             #atagtatt   1560                                                                  - - taaacaatcc tcaggaggag acccagaaat tgtaatgcac agttttaatt gt -             #agagggga   1620                                                                  - - atttttctac tgtaatacaa caaaactgtt taatagtact tggaatgaaa at -             #agtacttg   1680                                                                  - - gaatgctact ggaaatgaca ctatcacact cccatgtaga ataaaacaaa tt -             #ataaacat   1740                                                                  - - gtggcaggaa gtaggaaaag caatgtatgc ccctcccatc gaaggacaaa tt -             #agatgttc   1800                                                                  - - atcaaatatt acagggctgc tattaacaag agatggtggt ggtgacaaga ac -             #agtaccac   1860                                                                  - - cgagatcttt agacctgcag gaggaaatat gaaggacaat tggagaagtg aa -             #ttatataa   1920                                                                  - - atataaagta gtaaaaattg aaccattagg agtagcaccc accaaggcaa ag -             #agaagagt   1980                                                                  - - ggtgcaaaga gaaaaaagag cagtgggagt gataggagct atgttccttg gg -             #ttcttggg   2040                                                                  - - agcagcagga agcactatgg gcgcagcgtc aataacgctg acggtacagg cc -             #agaaaact   2100                                                                  - - attgtctggt atagtgcaac agcagaacaa tctgctgaga gctattgagg cg -             #caacagca   2160                                                                  - - tctgttgcaa ctcacagtct ggggcatcaa gcagctccag gcaagagtcc tg -             #gctgtgga   2220                                                                  - - aagataccta agagatcaac agctcctagg gatttggggt tgctctggaa aa -             #ctcatttg   2280                                                                  - - caccactact gtgccttgga atactagttg gagtaataaa tctctggata ag -             #atttggaa   2340                                                                  - - taacatgact tggatggagt gggaaagaga aattgacaat tacacaagct ta -             #atatacac   2400                                                                  - - cttacttgaa gaatcgcaaa accaacaaga aaagaatgaa caagagttat tg -             #gaattgga   2460                                                                  - - taagtgggca agtttgtgga attggtttag cataacaaac tggctgtggt at -             #ataagaat   2520                                                                  - - attcataatg atagtaggag gcttgatagg tttaagaata atttttgctg tg -             #ctttctat   2580                                                                  - - agtaaataga gttaggcagg gatactcacc attatcattt cagaccctca tc -             #ccagccca   2640                                                                  - - gaggggaccc gacaggcccg aaggaatcga agaaggaggt ggagagagag ac -             #agagacag   2700                                                                  - - atccactcga ttagtgaacg gattcttagc actgttctgg gacgatcttc gg -             #agcctgtg   2760                                                                  - - cctcttcagc taccaccgct tgacagactt actcttgatt gtagcgagga tt -             #gtggaact   2820                                                                  - - tctgggacgc agggggtggg aagtcctcaa atattggtgg aatctcctgc tg -             #tattggag   2880                                                                  - - tcaggaacta aagaatagtg ctgttagctt gctcaacgcc acagctatag ca -             #gtagctga   2940                                                                  - - agggacagat agggttatag aagtagtaca aagagtgggt agagctattc tc -             #cacatacc   3000                                                                  - - tacaagaata agacagggct ttgaaagggc tttgctataa gatgggtggc aa -             #gtggtcaa   3060                                                                  - - aaagtaaaat gggatggcct gctgtaaggg aaagaatgaa gcgagctgag cc -             #agcagcag   3120                                                                  - - atggggtggg agcagcatct agagacctgg aaaaacatgg agcactcaca ag -             #tagcaata   3180                                                                  - - cagcagctac taatgctgat tgtgcctggc tagaagcaca agaggatgag ga -             #ggtgggtt   3240                                                                  - - ttccagtcaa acctcaggta cctttaagac caatgactta caaagcagct tt -             #agatctta   3300                                                                  - - gccacttttt aaaagaaaag gggggactgg aagggctagt ttactcccaa aa -             #aagacaag   3360                                                                  - - atatccttga tctgtggatc taccacacac aaggctactt ccctgattgg ca -             #gaactaca   3420                                                                  - - caccagggcc aggggtcaga tttccactga cctttggatg gtgcttcaag tt -             #agtaccag   3480                                                                  - - tagagccaga gaaagtagaa gaggccaatg aaggagagaa caacagcttg tt -             #acacccta   3540                                                                  - - tgagcctgca tgggatggag gacccggaga aagaagtgtt agtgtggaag tt -             #tgacagcc   3600                                                                  - - acctagcatt tcgtcacatg gcccgagagc tgcatccgga gtactacaaa ga -             #ctgctgac   3660                                                                  - - atcgagtttt ctacaaggga ctttccgctg gggactttcc aggggaggcg tg -             #gcctgggc   3720                                                                  - - gggactgggg agtggcgagc cctcagatgc tgcatataag cagctgcttt tt -             #gcctgtac   3780                                                                  - - ggggtctctc tggttagacc agatctgagc ctgggagctc tctggctaac ta -             #gggaaccc   3840                                                                  - - actgcttaag cctcaataaa gcttgccttg agtgcttcaa gtagtgtgtg cc -             #cgtctgtt   3900                                                                  - - gtgtgactct gctatctaga gatccctcag acccttttag tcagtgtgga aa -             #atctctag   3960                                                                  - - caatatataa atatatcttt gacctttaca gcatatggta ataacttaaa aa -             #ttatatgc   4020                                                                  - - ctaattgtga aaaaaaaaaa agaaaaaaga actcttcttg ccagaatcca ag -             #tcccatga   4080                                                                  - - aagtagccaa tgctgtctca ttagttagta agctaatgga aatgttgcca gc -             #atttcttt   4140                                                                  - - cagtgtctag aaaacagagt gtgcaatgtg ccaagtcttc actgatttat tt -             #ttgtaagc   4200                                                                  - - agcagtgtaa taaacccaaa gaagccaaaa aagcaaattt ttaaaaaata aa -             #tattcatt   4260                                                                  - - tgctatcaag atgggtatga cctttttacc caagcctatt actgacaatt ca -             #gaaagact   4320                                                                  - - atgtgaaata gtcactcatt tatcttaatt gcatttgcag gtactaccac ca -             #ctcaagtt   4380                                                                  - - ttaaaatgtt tttaaacact caagtttgca ttcctttagc ttttatacaa ga -             #aaccacat   4440                                                                  - - tattttacat acatattaat tattttctga cctttcagga aaacccaata at -             #ataaatct   4500                                                                  - - acaaaatgaa ataatactca agaattc          - #                  - #                4527                                                                    __________________________________________________________________________ 

I claim:
 1. An isolated polypeptide that inhibits the replication of HIV-1 in CD8⁺ -depleted peripheral blood lymphocytes (PBMC) prepared from buffy coat of non-retrovirally infected normal human blood samples, wherein said polypeptide has the sequence shown in SEQ ID NO:2 or wherein said peptide deviates from the peptide of SEQ ID NO:2 only by one or more but not all of the substitutions or deletion selected from the group consisting of: amino acid 7 is Ser, amino acid 25 is Thr, amino acid 31 is Cys, amino acid 76 is Val, amino acid 112 is Thr, amino acid 121 is Ser, amino acid 128 is Gly, and amino acid 26 is deleted.
 2. An isolated polypeptide according to claim 1, wherein said polypeptide is the polypeptide of SEQ ID NO:2.
 3. A composition comprising the polypeptide of claim 1 in combination with a pharmaceutically acceptable carrier.
 4. The composition according to claim 3, wherein said polypeptide has the sequence of SEQ ID NO:2.
 5. A process for the production of a polypeptide, comprising the steps of inserting a DNA construct into the genome of a cell by homologous recombination, wherein said DNA construct comprises a) a DNA regulatory element capable of stimulating the expression of an exogenous gene which codes for the polypeptide of claim 1, when operatively linked thereto, b) a gene which codes for the polypeptide of claim 1, wherein the gene is operatively linked to said DNA regulatory element, and c) one or more DNA target segments homologous to a region in the genome of said cell, wherein the target segments are operatively linked to the DNA regulatory element and the gene, to produce transformed cells, culturing the cells under conditions which result in the expression of said polypetide, and recovering said polypeptide from the cells or cell culture supernatant.
 6. A DNA construct which can be introduced into the genome of cells by homologous recombination, comprising:a DNA regulatory element capable of modulating expression of a gene when operatively linked thereto, a gene coding for a polypeptide according to claim 1 operatively linked to the DNA regulatory element, and one or more DNA target segments homologous to a region in the genome of said cells, wherein the target segments are operatively linked to the DNA regulatory element and the gene.
 7. The DNA construct according to claim 6, wherein the construct is a retroviral vector or a non-viral DNA.
 8. An isolated nucleic acid molecule selected from the group consisting of:i) the nucleic acid molecule shown in SEQ ID NO:1 or the nucleic acid molecule complementary thereto. ii) a nucleic acid molecule which codes for the same polypeptide as the nucleic acid molecule of SEQ ID NO:1 or the nucleic acid molecule complementary thereto, and; iii) a nucleic acid molecule that codes for a polypeptide, wherein said polypeptide deviates from the polypeptide of SEQ ID NO:2 only by one or more but not all of the substitutions or deletion selected from the group consisting of: amino acid 7 is Ser, amino acid 25 is Thr, amino acid 31 is Cys, amino acid 76 is Val, amino acid 112 is Thr, amino acid 121 is Ser, amino acid 128 is Gly, and amino acid 26 is deleted.
 9. The isolated nucleic acid molecule according to claim 8, wherein said nucleic acid molecule has the sequence shown in SEQ ID NO:1 or the nucleic acid molecule complementary thereto.
 10. A method for expressing a polypeptide in a prokaryotic or eukaryotic host cell, comprisingtransfecting or transforming a host cell with the nucleic acid molecule of claim 8, culturing the transfected or transformed host cells under conditions suitable for the expression of said polypeptide; and isolating the polypeptide from the host cell or the culture supernatant of the host cell.
 11. A recombinant expression vector comprising the nucleic acid of claim
 8. 12. A prokaryotic or eukaryotic host cell which is transfected with the nucleic acid molecule of claim
 8. 13. The host cell according to claim 12, wherein said host cell is E.coli or a mammalian cell line. 